Zuxuan Wu
Identifiers
- name variant Zuxuan Wu 0.60 · backfill
Papers (51)
- Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026 · author #9
- Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification cs.CV · 2026 · author #9
- ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation cs.RO · 2026 · author #10
- RepWAM: World Action Modeling with Representation Visual-Action Tokenizers cs.CV · 2026 · author #6
- ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations cs.CV · 2026 · author #16
- IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder cs.CV · 2026 · author #8
- OmniGen-AR: AutoRegressive Any-to-Image Generation cs.CV · 2026 · author #6
- DisCo: World Models with Discrete Camera Motion Control cs.CV · 2026 · author #5
- ActiveMimic: Egocentric Video Pretraining with Active Perception cs.RO · 2026 · author #6
- EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation cs.CV · 2026 · author #5
- CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #13
- VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #5
- Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization cs.CV · 2026 · author #4
- Channel-wise Vector Quantization cs.CV · 2026 · author #5
- Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #11
- DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders cs.CV · 2026 · author #4
- Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #4
- Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #8
- DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models cs.LG · 2026 · author #10
- GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #18
- Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #3
- GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models cs.SD · 2026 · author #6
- HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #8
- CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #12
- Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #5
- HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving cs.RO · 2026 · author #7
- Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #23
- Unify Robot Actions in Camera Frame cs.RO · 2025 · author #11
- PreferThinker: Reasoning-based Personalized Image Preference Assessment cs.AI · 2025 · author #8
- Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue cs.RO · 2025 · author #7
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #17
- Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #10
- ACE: Adapting to Changing Environments for Semantic Segmentation cs.CV · 2019 · author #1
- An Analysis of Pre-Training on Object Detection cs.CV · 2019 · author #4
- The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation cs.AI · 2019 · author #2
- Compatible and Diverse Fashion Image Inpainting cs.CV · 2019 · author #2
- Self-Monitoring Navigation Agent via Auxiliary Progress Estimation cs.AI · 2019 · author #3
- AdaFrame: Adaptive Frame Selection for Fast Video Recognition cs.CV · 2018 · author #1
- DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation cs.CV · 2018 · author #1
- VITON: An Image-based Virtual Try-on Network cs.CV · 2017 · author #2
- BlockDrop: Dynamic Inference Paths in Residual Networks cs.CV · 2017 · author #1
- Automatic Spatially-aware Fashion Concept Discovery cs.CV · 2017 · author #2
- Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #2
- Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #5
- Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #2
- Weakly-Supervised Spatial Context Networks cs.CV · 2017 · author #1
- Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #1
- Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #1
- Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #2
- Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #1
- Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #2
Mentions
- 2606.29941 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2511.00609 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.18249 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.17937 #10 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.13674 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.11188 #16 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.11096 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.09156 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2606.07967 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 1509.06086 #1 · backfill · confidence 0.70 Zuxuan Wu
- 2606.06194 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2509.15061 #7 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 1504.01920 #2 · backfill · confidence 0.70 Zuxuan Wu
- 1504.01561 #1 · backfill · confidence 0.70 Zuxuan Wu
- 2606.03509 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 1502.07209 #2 · backfill · confidence 0.70 Zuxuan Wu
- 2605.12369 #18 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.30774 #13 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.29562 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.28615 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.26089 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.25195 #11 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.02900 #23 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.22777 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.18599 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
- 2605.18059 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
Frequent Coauthors
- Yu-Gang Jiang 32 shared papers
- Larry S. Davis 10 shared papers
- Junke Wang 6 shared papers
- Xintong Han 6 shared papers
- Xiangyang Xue 5 shared papers
- Xiaosong Jia 5 shared papers
- Xingjun Ma 5 shared papers
- Ziyi Ye 5 shared papers
- Xi Wang 4 shared papers
- Yitong Chen 4 shared papers
- Caiming Xiong 3 shared papers
- Chih-Yao Ma 3 shared papers
- Cong Wang 3 shared papers
- Guojin Zhong 3 shared papers
- Hao Ye 3 shared papers
- Junchi Yan 3 shared papers
- Shengqi Xu 3 shared papers
- Tianyi Lu 3 shared papers
- Weilin Huang 3 shared papers
- Xiang Zheng 3 shared papers