Yu-Gang Jiang

Identifiers

name variant Yu-Gang Jiang 0.60 · backfill

Papers (83)

Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026 · author #10
Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy cs.RO · 2026 · author #10
Event-Aware Instructed Assistant for Referring Video Segmentation cs.CV · 2026 · author #4
Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation cs.CV · 2026 · author #4
MambaADv2: Evolving Duality-enhanced State Space Model for Unsupervised Anomaly Detection cs.CV · 2026 · author #7
ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation cs.RO · 2026 · author #11
RepWAM: World Action Modeling with Representation Visual-Action Tokenizers cs.CV · 2026 · author #7
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations cs.CV · 2026 · author #18
IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder cs.CV · 2026 · author #7
UniDexTok: A Unified Dexterous Hand Tokenizer from Real Data cs.RO · 2026 · author #7
OmniGen-AR: AutoRegressive Any-to-Image Generation cs.CV · 2026 · author #7
Teach Multimodal Recommendation Model to See via Personalized Visual Extraction and Adaptive Learning cs.IR · 2026 · author #5 as printed: Yu-gang Jiang
Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data cs.RO · 2026 · author #14
DisCo: World Models with Discrete Camera Motion Control cs.CV · 2026 · author #4
Coarse-to-Control: Action-Token Planning for Vision-Language-Action Models cs.RO · 2026 · author #12
ActiveMimic: Egocentric Video Pretraining with Active Perception cs.RO · 2026 · author #7
EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation cs.CV · 2026 · author #6
Constitutional On-Policy Safe Distillation cs.LG · 2026 · author #11
BraveGuard: From Open-World Threats to Safer Computer-Use Agents cs.CR · 2026 · author #16
CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #14
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #6
Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #12
Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance cs.RO · 2026 · author #9
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook cs.SD · 2026 · author #32
Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #7 as printed: Yu-gang Jiang
Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #11
TAME: Test-Time Adversarial Prompt Tuning via Mixture-of-Experts for Vision-Language Models cs.CV · 2026 · author #9
DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models cs.CR · 2026 · author #10
GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #20
World Action Models: The Next Frontier in Embodied AI cs.RO · 2026 · author #14
Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #4
From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data cs.CV · 2026 · author #5
ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models cs.CL · 2026 · author #4
CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #36
Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses cs.CL · 2026 · author #11
Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models cs.CV · 2026 · author #6
SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning cs.CV · 2026 · author #7
ROSE: Retrieval-Oriented Segmentation Enhancement cs.CV · 2026 · author #4
HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #11
CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #13
AssemLM: A Spatial Reasoning Multimodal Large Language Model for Robotic Assembly cs.RO · 2026 · author #7
Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #7
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook cs.AI · 2026 · author #38
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #38
Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery cs.RO · 2026 · author #5
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #20
Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs cs.AI · 2026 · author #7
Memory in the Age of AI Agents cs.CL · 2025 · author #46
Boosting Reasoning in Large Multimodal Models via Activation Replay cs.CV · 2025 · author #7
Unify Robot Actions in Camera Frame cs.RO · 2025 · author #12
Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue cs.RO · 2025 · author #8
LeakyCLIP: Extracting Training Data from CLIP cs.CR · 2025 · author #6
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #48
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #11
Black-box Adversarial Attacks on Video Recognition Models cs.LG · 2019 · author #5
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization cs.LG · 2018 · author #5
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network cs.CV · 2018 · author #5
Composite Binary Decomposition Networks cs.LG · 2018 · author #5
Non-local NetVLAD Encoding for Video Classification cs.CV · 2018 · author #6
Object Detection from Scratch with Deep Supervision cs.CV · 2018 · author #4
NAIS: Neural Attentive Item Similarity Model for Recommendation cs.IR · 2018 · author #5
Recurrent Fusion Network for Image Captioning cs.CV · 2018 · author #3
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks cs.CV · 2018 · author #6
Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging cs.CV · 2018 · author #4
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images cs.CV · 2018 · author #6
Learning to score the figure skating sports videos cs.MM · 2018 · author #5
Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #7
Dual Skipping Networks cs.CV · 2017 · author #3
Recent Advances in Zero-shot Recognition cs.CV · 2017 · author #3
Multi-scale Deep Learning Architectures for Person Re-identification cs.CV · 2017 · author #3
DSOD: Learning Deeply Supervised Object Detectors from Scratch cs.CV · 2017 · author #4
Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #3
Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #6
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #1
Weakly Supervised Dense Video Captioning cs.CV · 2017 · author #6
Iterative Object and Part Transfer for Fine-Grained Recognition cs.CV · 2017 · author #2
Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #4
The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #3
Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization cs.CV · 2015 · author #3
Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #2
Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #5
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #3
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #1

Mentions

2606.29941 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2601.21233 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.27251 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.26994 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.26984 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.23126 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.17937 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.03089 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.13674 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2604.08983 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.11188 #18 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.11096 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.10683 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.09156 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.09082 #5 · arxiv_oai · confidence 0.70 Yu-gang Jiang
2606.08520 #14 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.07967 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.07107 #12 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2604.02029 #38 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
1511.04798 #3 · backfill · confidence 0.70 Yu-Gang Jiang
1509.06086 #2 · backfill · confidence 0.70 Yu-Gang Jiang
2606.06194 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2509.15061 #8 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
1504.01920 #5 · backfill · confidence 0.70 Yu-Gang Jiang
1504.01561 #3 · backfill · confidence 0.70 Yu-Gang Jiang
2606.03509 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
1502.07209 #1 · backfill · confidence 0.70 Yu-Gang Jiang
2605.12369 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2606.01166 #16 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2602.12984 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.30774 #14 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.29562 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.25195 #12 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.02900 #38 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.24203 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2508.00756 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.20266 #32 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.18868 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.18599 #7 · arxiv_oai · confidence 0.70 Yu-gang Jiang
2605.18059 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2605.17577 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
2604.25850 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang

Frequent Coauthors

Zuxuan Wu 32 shared papers
Xiangyang Xue 17 shared papers
Xingjun Ma 15 shared papers
Yanwei Fu 13 shared papers
Xuanjing Huang 8 shared papers
Xipeng Qiu 7 shared papers
Ziyi Ye 7 shared papers
Junke Wang 6 shared papers
Xiaosong Jia 6 shared papers
Jingjing Chen 5 shared papers
Tao Gui 5 shared papers
Xiang Zheng 5 shared papers
Xin Wang 5 shared papers
Yixu Wang 5 shared papers
Bo Li 4 shared papers
Cong Wang 4 shared papers
Henghui Ding 4 shared papers
Jiaming Zhang 4 shared papers
Jianguo Li 4 shared papers
Jingjing Gong 4 shared papers