pith. sign in

Xiangyu Yue

Identifiers

  • name variant Xiangyu Yue 0.60 · backfill

Papers (28)

  1. X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding cs.CV · 2026 · author #12
  2. $\tau_0$-WM: A Unified Video-Action World Model for Robotic Manipulation cs.RO · 2026 · author #18
  3. Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation cs.RO · 2026 · author #10
  4. From Web to Pixels: Bringing Agentic Search into Visual Perception cs.CV · 2026 · author #6
  5. BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion cs.CL · 2026 · author #6
  6. OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents cs.CV · 2026 · author #8
  7. OpenGame: Open Agentic Coding for Games cs.SE · 2026 · author #11
  8. A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning cs.AI · 2026 · author #11
  9. Gen-Searcher: Reinforcing Agentic Search for Image Generation cs.CV · 2026 · author #10
  10. RISE: Self-Improving Robot Policy with Compositional World Model cs.RO · 2026 · author #12
  11. MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation cs.CV · 2026 · author #11
  12. Exploring Reasoning Reward Model for Agents cs.AI · 2026 · author #10
  13. AdaTooler-V: Adaptive Tool-Use for Images and Videos cs.CV · 2025 · author #11
  14. OneThinker: All-in-one Reasoning Model for Image and Video cs.CV · 2025 · author #14
  15. SpaceVista: All-Scale Visual Spatial Reasoning from mm to km cs.CV · 2025 · author #11
  16. VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning cs.CV · 2025 · author #10
  17. ReSim: Reliable World Simulation for Autonomous Driving cs.CV · 2025 · author #9
  18. MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence cs.CV · 2025 · author #10
  19. Video-R1: Reinforcing Video Reasoning in MLLMs cs.CV · 2025 · author #10
  20. LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model cs.CV · 2023 · author #10
  21. Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge cs.CV · 2018 · author #398
  22. A Novel Domain Adaptation Framework for Medical Image Segmentation cs.CV · 2018 · author #5
  23. Scenic: A Language for Scenario Specification and Scene Generation cs.PL · 2018 · author #4
  24. SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud cs.CV · 2018 · author #4
  25. Counterexample-Guided Data Augmentation cs.LG · 2018 · author #3
  26. A LiDAR Point Cloud Generator: from a Virtual World to Autonomous Driving cs.CV · 2018 · author #1
  27. Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions cs.CV · 2017 · author #3
  28. SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud cs.CV · 2017 · author #3

Mentions

  • 2606.02482 #12 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2606.01027 #18 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2510.08555 #10 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2602.09878 #11 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2510.09606 #11 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2505.23764 #10 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2603.28767 #10 · arxiv_oai · confidence 0.70 Xiangyu Yue
  • 2605.21258 #10 · arxiv_oai · confidence 0.70 Xiangyu Yue

Frequent Coauthors