pith. sign in

Yu-Gang Jiang

Identifiers

  • name variant Yu-Gang Jiang 0.60 · backfill

Papers (83)

  1. Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026 · author #10
  2. Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy cs.RO · 2026 · author #10
  3. Event-Aware Instructed Assistant for Referring Video Segmentation cs.CV · 2026 · author #4
  4. Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation cs.CV · 2026 · author #4
  5. MambaADv2: Evolving Duality-enhanced State Space Model for Unsupervised Anomaly Detection cs.CV · 2026 · author #7
  6. ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation cs.RO · 2026 · author #11
  7. RepWAM: World Action Modeling with Representation Visual-Action Tokenizers cs.CV · 2026 · author #7
  8. ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations cs.CV · 2026 · author #18
  9. IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder cs.CV · 2026 · author #7
  10. UniDexTok: A Unified Dexterous Hand Tokenizer from Real Data cs.RO · 2026 · author #7
  11. OmniGen-AR: AutoRegressive Any-to-Image Generation cs.CV · 2026 · author #7
  12. Teach Multimodal Recommendation Model to See via Personalized Visual Extraction and Adaptive Learning cs.IR · 2026 · author #5 as printed: Yu-gang Jiang
  13. Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data cs.RO · 2026 · author #14
  14. DisCo: World Models with Discrete Camera Motion Control cs.CV · 2026 · author #4
  15. Coarse-to-Control: Action-Token Planning for Vision-Language-Action Models cs.RO · 2026 · author #12
  16. ActiveMimic: Egocentric Video Pretraining with Active Perception cs.RO · 2026 · author #7
  17. EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation cs.CV · 2026 · author #6
  18. Constitutional On-Policy Safe Distillation cs.LG · 2026 · author #11
  19. BraveGuard: From Open-World Threats to Safer Computer-Use Agents cs.CR · 2026 · author #16
  20. CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #14
  21. VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #6
  22. Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #12
  23. Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance cs.RO · 2026 · author #9
  24. A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook cs.SD · 2026 · author #32
  25. Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #7 as printed: Yu-gang Jiang
  26. Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #11
  27. TAME: Test-Time Adversarial Prompt Tuning via Mixture-of-Experts for Vision-Language Models cs.CV · 2026 · author #9
  28. DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models cs.CR · 2026 · author #10
  29. GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #20
  30. World Action Models: The Next Frontier in Embodied AI cs.RO · 2026 · author #14
  31. Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #4
  32. From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data cs.CV · 2026 · author #5
  33. ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models cs.CL · 2026 · author #4
  34. CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #36
  35. Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses cs.CL · 2026 · author #11
  36. Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models cs.CV · 2026 · author #6
  37. SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning cs.CV · 2026 · author #7
  38. ROSE: Retrieval-Oriented Segmentation Enhancement cs.CV · 2026 · author #4
  39. HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #11
  40. CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #13
  41. AssemLM: A Spatial Reasoning Multimodal Large Language Model for Robotic Assembly cs.RO · 2026 · author #7
  42. Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #7
  43. The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook cs.AI · 2026 · author #38
  44. Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #38
  45. Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery cs.RO · 2026 · author #5
  46. SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #20
  47. Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs cs.AI · 2026 · author #7
  48. Memory in the Age of AI Agents cs.CL · 2025 · author #46
  49. Boosting Reasoning in Large Multimodal Models via Activation Replay cs.CV · 2025 · author #7
  50. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #12
  51. Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue cs.RO · 2025 · author #8
  52. LeakyCLIP: Extracting Training Data from CLIP cs.CR · 2025 · author #6
  53. Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #48
  54. Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #11
  55. Black-box Adversarial Attacks on Video Recognition Models cs.LG · 2019 · author #5
  56. A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization cs.LG · 2018 · author #5
  57. Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network cs.CV · 2018 · author #5
  58. Composite Binary Decomposition Networks cs.LG · 2018 · author #5
  59. Non-local NetVLAD Encoding for Video Classification cs.CV · 2018 · author #6
  60. Object Detection from Scratch with Deep Supervision cs.CV · 2018 · author #4
  61. NAIS: Neural Attentive Item Similarity Model for Recommendation cs.IR · 2018 · author #5
  62. Recurrent Fusion Network for Image Captioning cs.CV · 2018 · author #3
  63. Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks cs.CV · 2018 · author #6
  64. Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging cs.CV · 2018 · author #4
  65. Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images cs.CV · 2018 · author #6
  66. Learning to score the figure skating sports videos cs.MM · 2018 · author #5
  67. Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #7
  68. Dual Skipping Networks cs.CV · 2017 · author #3
  69. Recent Advances in Zero-shot Recognition cs.CV · 2017 · author #3
  70. Multi-scale Deep Learning Architectures for Person Re-identification cs.CV · 2017 · author #3
  71. DSOD: Learning Deeply Supervised Object Detectors from Scratch cs.CV · 2017 · author #4
  72. Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #3
  73. Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #6
  74. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #1
  75. Weakly Supervised Dense Video Captioning cs.CV · 2017 · author #6
  76. Iterative Object and Part Transfer for Fine-Grained Recognition cs.CV · 2017 · author #2
  77. Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #4
  78. The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #3
  79. Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization cs.CV · 2015 · author #3
  80. Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #2
  81. Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #5
  82. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #3
  83. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #1

Mentions

  • 2606.29941 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2601.21233 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.27251 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.26994 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.26984 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.23126 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.17937 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.03089 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.13674 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2604.08983 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.11188 #18 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.11096 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.10683 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.09156 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.09082 #5 · arxiv_oai · confidence 0.70 Yu-gang Jiang
  • 2606.08520 #14 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.07967 #4 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.07107 #12 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2604.02029 #38 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 1511.04798 #3 · backfill · confidence 0.70 Yu-Gang Jiang
  • 1509.06086 #2 · backfill · confidence 0.70 Yu-Gang Jiang
  • 2606.06194 #7 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2509.15061 #8 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 1504.01920 #5 · backfill · confidence 0.70 Yu-Gang Jiang
  • 1504.01561 #3 · backfill · confidence 0.70 Yu-Gang Jiang
  • 2606.03509 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 1502.07209 #1 · backfill · confidence 0.70 Yu-Gang Jiang
  • 2605.12369 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.01166 #16 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2602.12984 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.30774 #14 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.29562 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.25195 #12 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.02900 #38 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.24203 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2508.00756 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.20266 #32 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.18868 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.18599 #7 · arxiv_oai · confidence 0.70 Yu-gang Jiang
  • 2605.18059 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.17577 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2604.25850 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang

Frequent Coauthors