Zuxuan Wu — Pith Author Registry

Identifiers

name variant Zuxuan Wu 0.60 · backfill

Papers (51)

Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026 · author #9
Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification cs.CV · 2026 · author #9
ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation cs.RO · 2026 · author #10
RepWAM: World Action Modeling with Representation Visual-Action Tokenizers cs.CV · 2026 · author #6
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations cs.CV · 2026 · author #16
IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder cs.CV · 2026 · author #8
OmniGen-AR: AutoRegressive Any-to-Image Generation cs.CV · 2026 · author #6
DisCo: World Models with Discrete Camera Motion Control cs.CV · 2026 · author #5
ActiveMimic: Egocentric Video Pretraining with Active Perception cs.RO · 2026 · author #6
EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation cs.CV · 2026 · author #5
CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #13
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #5
Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization cs.CV · 2026 · author #4
Channel-wise Vector Quantization cs.CV · 2026 · author #5
Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #11
DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders cs.CV · 2026 · author #4
Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #4
Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #8
DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models cs.LG · 2026 · author #10
GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #18
Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #3
GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models cs.SD · 2026 · author #6
HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #8
CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #12
Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #5
HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving cs.RO · 2026 · author #7
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #23
Unify Robot Actions in Camera Frame cs.RO · 2025 · author #11
PreferThinker: Reasoning-based Personalized Image Preference Assessment cs.AI · 2025 · author #8
Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue cs.RO · 2025 · author #7
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #17
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #10
ACE: Adapting to Changing Environments for Semantic Segmentation cs.CV · 2019 · author #1
An Analysis of Pre-Training on Object Detection cs.CV · 2019 · author #4
The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation cs.AI · 2019 · author #2
Compatible and Diverse Fashion Image Inpainting cs.CV · 2019 · author #2
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation cs.AI · 2019 · author #3
AdaFrame: Adaptive Frame Selection for Fast Video Recognition cs.CV · 2018 · author #1
DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation cs.CV · 2018 · author #1
VITON: An Image-based Virtual Try-on Network cs.CV · 2017 · author #2
BlockDrop: Dynamic Inference Paths in Residual Networks cs.CV · 2017 · author #1
Automatic Spatially-aware Fashion Concept Discovery cs.CV · 2017 · author #2
Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #2
Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #5
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #2
Weakly-Supervised Spatial Context Networks cs.CV · 2017 · author #1
Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #1
Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #1
Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #2
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #1
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #2

Mentions

2606.29941 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
2511.00609 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.18249 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.17937 #10 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.13674 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.11188 #16 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.11096 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.09156 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
2606.07967 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
1509.06086 #1 · backfill · confidence 0.70 Zuxuan Wu
2606.06194 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
2509.15061 #7 · arxiv_oai · confidence 0.70 Zuxuan Wu
1504.01920 #2 · backfill · confidence 0.70 Zuxuan Wu
1504.01561 #1 · backfill · confidence 0.70 Zuxuan Wu
2606.03509 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
1502.07209 #2 · backfill · confidence 0.70 Zuxuan Wu
2605.12369 #18 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.30774 #13 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.29562 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.28615 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.26089 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.25195 #11 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.02900 #23 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.22777 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.18599 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
2605.18059 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu

Frequent Coauthors

Yu-Gang Jiang 32 shared papers
Larry S. Davis 10 shared papers
Junke Wang 6 shared papers
Xintong Han 6 shared papers
Xiangyang Xue 5 shared papers
Xiaosong Jia 5 shared papers
Xingjun Ma 5 shared papers
Ziyi Ye 5 shared papers
Xi Wang 4 shared papers
Yitong Chen 4 shared papers
Caiming Xiong 3 shared papers
Chih-Yao Ma 3 shared papers
Cong Wang 3 shared papers
Guojin Zhong 3 shared papers
Hao Ye 3 shared papers
Junchi Yan 3 shared papers
Shengqi Xu 3 shared papers
Tianyi Lu 3 shared papers
Weilin Huang 3 shared papers
Xiang Zheng 3 shared papers