Yu-Gang Jiang
Identifiers
No identifiers captured yet.
Papers (49)
- GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #20
- World Action Models: The Next Frontier in Embodied AI cs.RO · 2026 · author #14
- Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #4
- From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data cs.CV · 2026 · author #5
- ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models cs.CL · 2026 · author #4
- CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #36
- Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models cs.CV · 2026 · author #6
- SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning cs.CV · 2026 · author #7
- ROSE: Retrieval-Oriented Segmentation Enhancement cs.CV · 2026 · author #4
- HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #11
- CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #13
- AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly cs.RO · 2026 · author #6
- Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #7
- Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #34
- Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery cs.RO · 2026 · author #5
- Memory in the Age of AI Agents cs.CL · 2025 · author #46
- Boosting Reasoning in Large Multimodal Models via Activation Replay cs.CV · 2025 · author #7
- Unify Robot Actions in Camera Frame cs.RO · 2025 · author #12
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #48
- Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #11
- Black-box Adversarial Attacks on Video Recognition Models cs.LG · 2019 · author #5
- A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization cs.LG · 2018 · author #5
- Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network cs.CV · 2018 · author #5
- Composite Binary Decomposition Networks cs.LG · 2018 · author #5
- Non-local NetVLAD Encoding for Video Classification cs.CV · 2018 · author #6
- Object Detection from Scratch with Deep Supervision cs.CV · 2018 · author #4
- NAIS: Neural Attentive Item Similarity Model for Recommendation cs.IR · 2018 · author #5
- Recurrent Fusion Network for Image Captioning cs.CV · 2018 · author #3
- Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks cs.CV · 2018 · author #6
- Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging cs.CV · 2018 · author #4
- Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images cs.CV · 2018 · author #6
- Learning to score the figure skating sports videos cs.MM · 2018 · author #5
- Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #7
- Dual Skipping Networks cs.CV · 2017 · author #3
- Recent Advances in Zero-shot Recognition cs.CV · 2017 · author #3
- Multi-scale Deep Learning Architectures for Person Re-identification cs.CV · 2017 · author #3
- DSOD: Learning Deeply Supervised Object Detectors from Scratch cs.CV · 2017 · author #4
- Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #3
- Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #6
- Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #1
- Weakly Supervised Dense Video Captioning cs.CV · 2017 · author #6
- Iterative Object and Part Transfer for Fine-Grained Recognition cs.CV · 2017 · author #2
- Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #4
- The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #3
- Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization cs.CV · 2015 · author #3
- Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #2
- Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #5
- Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #3
- Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Zuxuan Wu 17 shared papers
- Xiangyang Xue 16 shared papers
- Yanwei Fu 11 shared papers
- Xingjun Ma 6 shared papers
- Xuanjing Huang 5 shared papers
- Jianguo Li 4 shared papers
- Jingjing Chen 4 shared papers
- Shaoxiang Chen 4 shared papers
- Wei Liu 4 shared papers
- Xi Wang 4 shared papers
- Yunhan Zhao 4 shared papers
- Zhiqiang Shen 4 shared papers
- Bo Li 3 shared papers
- Cong Wang 3 shared papers
- Hao Ye 3 shared papers
- James Bailey 3 shared papers
- Lin Ma 3 shared papers
- Qi Zhang 3 shared papers
- Tao Gui 3 shared papers
- Tao Xiang 3 shared papers