pith. machine review for the scientific record. sign in

Yu-Gang Jiang

Identifiers

No identifiers captured yet.

Papers (49)

  1. GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #20
  2. World Action Models: The Next Frontier in Embodied AI cs.RO · 2026 · author #14
  3. Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #4
  4. From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data cs.CV · 2026 · author #5
  5. ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models cs.CL · 2026 · author #4
  6. CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #36
  7. Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models cs.CV · 2026 · author #6
  8. SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning cs.CV · 2026 · author #7
  9. ROSE: Retrieval-Oriented Segmentation Enhancement cs.CV · 2026 · author #4
  10. HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #11
  11. CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #13
  12. AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly cs.RO · 2026 · author #6
  13. Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #7
  14. Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #34
  15. Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery cs.RO · 2026 · author #5
  16. Memory in the Age of AI Agents cs.CL · 2025 · author #46
  17. Boosting Reasoning in Large Multimodal Models via Activation Replay cs.CV · 2025 · author #7
  18. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #12
  19. Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #48
  20. Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #11
  21. Black-box Adversarial Attacks on Video Recognition Models cs.LG · 2019 · author #5
  22. A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization cs.LG · 2018 · author #5
  23. Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network cs.CV · 2018 · author #5
  24. Composite Binary Decomposition Networks cs.LG · 2018 · author #5
  25. Non-local NetVLAD Encoding for Video Classification cs.CV · 2018 · author #6
  26. Object Detection from Scratch with Deep Supervision cs.CV · 2018 · author #4
  27. NAIS: Neural Attentive Item Similarity Model for Recommendation cs.IR · 2018 · author #5
  28. Recurrent Fusion Network for Image Captioning cs.CV · 2018 · author #3
  29. Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks cs.CV · 2018 · author #6
  30. Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging cs.CV · 2018 · author #4
  31. Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images cs.CV · 2018 · author #6
  32. Learning to score the figure skating sports videos cs.MM · 2018 · author #5
  33. Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #7
  34. Dual Skipping Networks cs.CV · 2017 · author #3
  35. Recent Advances in Zero-shot Recognition cs.CV · 2017 · author #3
  36. Multi-scale Deep Learning Architectures for Person Re-identification cs.CV · 2017 · author #3
  37. DSOD: Learning Deeply Supervised Object Detectors from Scratch cs.CV · 2017 · author #4
  38. Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #3
  39. Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #6
  40. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #1
  41. Weakly Supervised Dense Video Captioning cs.CV · 2017 · author #6
  42. Iterative Object and Part Transfer for Fine-Grained Recognition cs.CV · 2017 · author #2
  43. Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #4
  44. The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #3
  45. Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization cs.CV · 2015 · author #3
  46. Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #2
  47. Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #5
  48. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #3
  49. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors