pith. sign in

Zuxuan Wu

Identifiers

  • name variant Zuxuan Wu 0.60 · backfill

Papers (51)

  1. Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026 · author #9
  2. Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification cs.CV · 2026 · author #9
  3. ThinkingVLA: Interleaved Vision and Language Reasoning for Robotic Manipulation cs.RO · 2026 · author #10
  4. RepWAM: World Action Modeling with Representation Visual-Action Tokenizers cs.CV · 2026 · author #6
  5. ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations cs.CV · 2026 · author #16
  6. IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder cs.CV · 2026 · author #8
  7. OmniGen-AR: AutoRegressive Any-to-Image Generation cs.CV · 2026 · author #6
  8. DisCo: World Models with Discrete Camera Motion Control cs.CV · 2026 · author #5
  9. ActiveMimic: Egocentric Video Pretraining with Active Perception cs.RO · 2026 · author #6
  10. EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation cs.CV · 2026 · author #5
  11. CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #13
  12. VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #5
  13. Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization cs.CV · 2026 · author #4
  14. Channel-wise Vector Quantization cs.CV · 2026 · author #5
  15. Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #11
  16. DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders cs.CV · 2026 · author #4
  17. Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #4
  18. Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #8
  19. DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models cs.LG · 2026 · author #10
  20. GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #18
  21. Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #3
  22. GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models cs.SD · 2026 · author #6
  23. HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #8
  24. CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #12
  25. Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #5
  26. HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving cs.RO · 2026 · author #7
  27. Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #23
  28. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #11
  29. PreferThinker: Reasoning-based Personalized Image Preference Assessment cs.AI · 2025 · author #8
  30. Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue cs.RO · 2025 · author #7
  31. Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #17
  32. Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #10
  33. ACE: Adapting to Changing Environments for Semantic Segmentation cs.CV · 2019 · author #1
  34. An Analysis of Pre-Training on Object Detection cs.CV · 2019 · author #4
  35. The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation cs.AI · 2019 · author #2
  36. Compatible and Diverse Fashion Image Inpainting cs.CV · 2019 · author #2
  37. Self-Monitoring Navigation Agent via Auxiliary Progress Estimation cs.AI · 2019 · author #3
  38. AdaFrame: Adaptive Frame Selection for Fast Video Recognition cs.CV · 2018 · author #1
  39. DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation cs.CV · 2018 · author #1
  40. VITON: An Image-based Virtual Try-on Network cs.CV · 2017 · author #2
  41. BlockDrop: Dynamic Inference Paths in Residual Networks cs.CV · 2017 · author #1
  42. Automatic Spatially-aware Fashion Concept Discovery cs.CV · 2017 · author #2
  43. Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #2
  44. Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #5
  45. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #2
  46. Weakly-Supervised Spatial Context Networks cs.CV · 2017 · author #1
  47. Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #1
  48. Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #1
  49. Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #2
  50. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #1
  51. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #2

Mentions

  • 2606.29941 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2511.00609 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.18249 #9 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.17937 #10 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.13674 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.11188 #16 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.11096 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.09156 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2606.07967 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 1509.06086 #1 · backfill · confidence 0.70 Zuxuan Wu
  • 2606.06194 #6 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2509.15061 #7 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 1504.01920 #2 · backfill · confidence 0.70 Zuxuan Wu
  • 1504.01561 #1 · backfill · confidence 0.70 Zuxuan Wu
  • 2606.03509 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 1502.07209 #2 · backfill · confidence 0.70 Zuxuan Wu
  • 2605.12369 #18 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.30774 #13 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.29562 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.28615 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.26089 #5 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.25195 #11 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.02900 #23 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.22777 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.18599 #4 · arxiv_oai · confidence 0.70 Zuxuan Wu
  • 2605.18059 #8 · arxiv_oai · confidence 0.70 Zuxuan Wu

Frequent Coauthors