pith. machine review for the scientific record. sign in

Zuxuan Wu

Identifiers

No identifiers captured yet.

Papers (31)

  1. DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models cs.LG · 2026 · author #10
  2. GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #18
  3. Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #3
  4. GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models cs.SD · 2026 · author #6
  5. HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #8
  6. CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #12
  7. Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #5
  8. HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving cs.RO · 2026 · author #7
  9. Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #22
  10. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #11
  11. Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #17
  12. Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #10
  13. ACE: Adapting to Changing Environments for Semantic Segmentation cs.CV · 2019 · author #1
  14. An Analysis of Pre-Training on Object Detection cs.CV · 2019 · author #4
  15. The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation cs.AI · 2019 · author #2
  16. Compatible and Diverse Fashion Image Inpainting cs.CV · 2019 · author #2
  17. Self-Monitoring Navigation Agent via Auxiliary Progress Estimation cs.AI · 2019 · author #3
  18. AdaFrame: Adaptive Frame Selection for Fast Video Recognition cs.CV · 2018 · author #1
  19. DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation cs.CV · 2018 · author #1
  20. VITON: An Image-based Virtual Try-on Network cs.CV · 2017 · author #2
  21. BlockDrop: Dynamic Inference Paths in Residual Networks cs.CV · 2017 · author #1
  22. Automatic Spatially-aware Fashion Concept Discovery cs.CV · 2017 · author #2
  23. Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #2
  24. Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #5
  25. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #2
  26. Weakly-Supervised Spatial Context Networks cs.CV · 2017 · author #1
  27. Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #1
  28. Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #1
  29. Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #2
  30. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #1
  31. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors