pith. sign in

Yaowei Wang

Identifiers

  • name variant Yaowei Wang 0.60 · backfill

Papers (23)

  1. GenEraser: Generalizable Video Object Removal via Balanced Text-Mask Guidance and Decoupled Locator-Preserver cs.CV · 2026 · author #5
  2. RadioFormer3D: Weakly Supervised 3D Radio Map Estimation in Low-Altitude Airspace via Generative Modeling cs.CV · 2026 · author #5
  3. CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception cs.CV · 2026 · author #7
  4. SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation cs.CV · 2026 · author #7
  5. CPC-VAR:Continual Personalized and Compositional Generation in Visual Autoregressive Models cs.CV · 2026 · author #7
  6. Dance Across Shifts: Forward-Facilitation Continual Test-Time Adaptation through Dynamic Style Bridging cs.CV · 2026 · author #5
  7. MARS: Technical Report for the CASTLE Challenge at EgoVis 2026 cs.CV · 2026 · author #6
  8. Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval cs.CV · 2026 · author #7
  9. CAST: Mitigating Object Hallucination in Large Vision-Language Models via Caption-Guided Visual Attention Steering cs.CV · 2026 · author #10
  10. Beyond Heuristics: Learnable Density Control for 3D Gaussian Splatting cs.CV · 2026 · author #5
  11. Efficient Adversarial Training via Criticality-Aware Fine-Tuning cs.CV · 2026 · author #4
  12. Latent-Condensed Transformer for Efficient Long Context Modeling cs.CL · 2026 · author #7
  13. Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval cs.CV · 2026 · author #5
  14. From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents cs.CV · 2026 · author #6
  15. MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages cs.CL · 2025 · author #10
  16. HiPrune: Hierarchical Attention for Efficient Token Pruning in Vision-Language Models cs.CV · 2025 · author #8
  17. Norm$\times$Direction: Restoring the Missing Query Norm in Vision Linear Attention cs.LG · 2025 · author #5
  18. VMamba: Visual State Space Model cs.CV · 2024 · author #6
  19. ODN: Opening the Deep Network for Open-set Action Recognition cs.CV · 2019 · author #3
  20. Deep Transfer Learning for Person Re-identification cs.CV · 2016 · author #2
  21. Learning long-term dependencies for action recognition with a biologically-inspired deep network cs.CV · 2016 · author #3
  22. Joint Network based Attention for Action Recognition cs.CV · 2016 · author #3
  23. Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN cs.CV · 2016 · author #3

Mentions

  • 2605.30045 #5 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.29538 #5 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2506.21137 #5 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.23655 #7 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.22658 #7 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.19750 #7 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.18608 #5 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2605.18176 #6 · arxiv_oai · confidence 0.70 Yaowei Wang
  • 2401.10166 #6 · arxiv_oai · confidence 0.70 Yaowei Wang

Frequent Coauthors