Identifiers
-
name variant
Yu-Gang Jiang
0.60 · backfill
Papers (83)
-
Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation
cs.RO · 2026 · author #10
-
Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy
cs.RO · 2026 · author #10
-
Event-Aware Instructed Assistant for Referring Video Segmentation
cs.CV · 2026 · author #4
-
Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation
cs.CV · 2026 · author #4
-
MambaADv2: Evolving Duality-enhanced State Space Model for Unsupervised Anomaly Detection
cs.CV · 2026 · author #7
-
ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation
cs.RO · 2026 · author #11
-
RepWAM: World Action Modeling with Representation Visual-Action Tokenizers
cs.CV · 2026 · author #7
-
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations
cs.CV · 2026 · author #18
-
IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder
cs.CV · 2026 · author #7
-
UniDexTok: A Unified Dexterous Hand Tokenizer from Real Data
cs.RO · 2026 · author #7
-
OmniGen-AR: AutoRegressive Any-to-Image Generation
cs.CV · 2026 · author #7
-
Teach Multimodal Recommendation Model to See via Personalized Visual Extraction and Adaptive Learning
cs.IR · 2026 · author #5
as printed: Yu-gang Jiang
-
Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data
cs.RO · 2026 · author #14
-
DisCo: World Models with Discrete Camera Motion Control
cs.CV · 2026 · author #4
-
Coarse-to-Control: Action-Token Planning for Vision-Language-Action Models
cs.RO · 2026 · author #12
-
ActiveMimic: Egocentric Video Pretraining with Active Perception
cs.RO · 2026 · author #7
-
EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation
cs.CV · 2026 · author #6
-
Constitutional On-Policy Safe Distillation
cs.LG · 2026 · author #11
-
BraveGuard: From Open-World Threats to Safer Computer-Use Agents
cs.CR · 2026 · author #16
-
CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
cs.CV · 2026 · author #14
-
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
cs.RO · 2026 · author #6
-
Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation
cs.CV · 2026 · author #12
-
Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance
cs.RO · 2026 · author #9
-
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook
cs.SD · 2026 · author #32
-
Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling
cs.CV · 2026 · author #7
as printed: Yu-gang Jiang
-
Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations
cs.RO · 2026 · author #11
-
TAME: Test-Time Adversarial Prompt Tuning via Mixture-of-Experts for Vision-Language Models
cs.CV · 2026 · author #9
-
DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models
cs.CR · 2026 · author #10
-
GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization
cs.RO · 2026 · author #20
-
World Action Models: The Next Frontier in Embodied AI
cs.RO · 2026 · author #14
-
Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval
cs.CV · 2026 · author #4
-
From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data
cs.CV · 2026 · author #5
-
ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models
cs.CL · 2026 · author #4
-
CL-bench Life: Can Language Models Learn from Real-Life Context?
cs.CL · 2026 · author #36
-
Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses
cs.CL · 2026 · author #11
-
Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models
cs.CV · 2026 · author #6
-
SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning
cs.CV · 2026 · author #7
-
ROSE: Retrieval-Oriented Segmentation Enhancement
cs.CV · 2026 · author #4
-
HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models
cs.RO · 2026 · author #11
-
CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation
cs.CV · 2026 · author #13
-
AssemLM: A Spatial Reasoning Multimodal Large Language Model for Robotic Assembly
cs.RO · 2026 · author #7
-
Steering the Verifiability of Multimodal AI Hallucinations
cs.AI · 2026 · author #7
-
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
cs.AI · 2026 · author #38
-
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses
cs.CR · 2026 · author #38
-
Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery
cs.RO · 2026 · author #5
-
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents
cs.CL · 2026 · author #20
-
Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs
cs.AI · 2026 · author #7
-
Memory in the Age of AI Agents
cs.CL · 2025 · author #46
-
Boosting Reasoning in Large Multimodal Models via Activation Replay
cs.CV · 2025 · author #7
-
Unify Robot Actions in Camera Frame
cs.RO · 2025 · author #12
-
Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue
cs.RO · 2025 · author #8
-
LeakyCLIP: Extracting Training Data from CLIP
cs.CR · 2025 · author #6
-
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety
cs.CR · 2025 · author #48
-
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
cs.CV · 2024 · author #11
-
Black-box Adversarial Attacks on Video Recognition Models
cs.LG · 2019 · author #5
-
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
cs.LG · 2018 · author #5
-
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network
cs.CV · 2018 · author #5
-
Composite Binary Decomposition Networks
cs.LG · 2018 · author #5
-
Non-local NetVLAD Encoding for Video Classification
cs.CV · 2018 · author #6
-
Object Detection from Scratch with Deep Supervision
cs.CV · 2018 · author #4
-
NAIS: Neural Attentive Item Similarity Model for Recommendation
cs.IR · 2018 · author #5
-
Recurrent Fusion Network for Image Captioning
cs.CV · 2018 · author #3
-
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks
cs.CV · 2018 · author #6
-
Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging
cs.CV · 2018 · author #4
-
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images
cs.CV · 2018 · author #6
-
Learning to score the figure skating sports videos
cs.MM · 2018 · author #5
-
Pose-Normalized Image Generation for Person Re-identification
cs.CV · 2017 · author #7
-
Dual Skipping Networks
cs.CV · 2017 · author #3
-
Recent Advances in Zero-shot Recognition
cs.CV · 2017 · author #3
-
Multi-scale Deep Learning Architectures for Person Re-identification
cs.CV · 2017 · author #3
-
DSOD: Learning Deeply Supervised Object Detectors from Scratch
cs.CV · 2017 · author #4
-
Learning Fashion Compatibility with Bidirectional LSTMs
cs.CV · 2017 · author #3
-
Aggregating Frame-level Features for Large-Scale Video Classification
cs.CV · 2017 · author #6
-
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification
cs.MM · 2017 · author #1
-
Weakly Supervised Dense Video Captioning
cs.CV · 2017 · author #6
-
Iterative Object and Part Transfer for Fine-Grained Recognition
cs.CV · 2017 · author #2
-
Deep Learning for Video Classification and Captioning
cs.CV · 2016 · author #4
-
The THUMOS Challenge on Action Recognition for Videos "in the Wild"
cs.CV · 2016 · author #3
-
Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization
cs.CV · 2015 · author #3
-
Fusing Multi-Stream Deep Networks for Video Classification
cs.CV · 2015 · author #2
-
Evaluating Two-Stream CNN for Video Classification
cs.CV · 2015 · author #5
-
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification
cs.CV · 2015 · author #3
-
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks
cs.CV · 2015 · author #1
Mentions
-
2606.29941
#10 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2601.21233
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.27251
#10 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.26994
#4 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.26984
#4 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.23126
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.17937
#11 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.03089
#11 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.13674
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2604.08983
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.11188
#18 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.11096
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.10683
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.09156
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.09082
#5 · arxiv_oai · confidence 0.70
Yu-gang Jiang
-
2606.08520
#14 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.07967
#4 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.07107
#12 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2604.02029
#38 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
1511.04798
#3 · backfill · confidence 0.70
Yu-Gang Jiang
-
1509.06086
#2 · backfill · confidence 0.70
Yu-Gang Jiang
-
2606.06194
#7 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2509.15061
#8 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
1504.01920
#5 · backfill · confidence 0.70
Yu-Gang Jiang
-
1504.01561
#3 · backfill · confidence 0.70
Yu-Gang Jiang
-
2606.03509
#6 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
1502.07209
#1 · backfill · confidence 0.70
Yu-Gang Jiang
-
2605.12369
#20 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2606.01166
#16 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2602.12984
#20 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.30774
#14 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.29562
#6 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.25195
#12 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.02900
#38 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.24203
#9 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2508.00756
#6 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.20266
#32 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.18868
#10 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.18599
#7 · arxiv_oai · confidence 0.70
Yu-gang Jiang
-
2605.18059
#11 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2605.17577
#9 · arxiv_oai · confidence 0.70
Yu-Gang Jiang
-
2604.25850
#11 · arxiv_oai · confidence 0.70
Yu-Gang Jiang