pith. sign in

Yueting Zhuang

Identifiers

  • name variant Yueting Zhuang 0.60 · backfill

Papers (41)

  1. VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies cs.CV · 2026 · author #12
  2. MAIGO: Mitigating Lost-in-Conversation with History-Cleaned On-Policy Self-Distillation cs.CL · 2026 · author #8
  3. InstructSAM: Segment Any Instance with Any Instructions cs.CV · 2026 · author #8
  4. CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark cs.CV · 2026 · author #7
  5. Self-Distilled Agentic Reinforcement Learning cs.LG · 2026 · author #10
  6. Milestone-Guided Policy Learning for Long-Horizon Language Agents cs.CL · 2026 · author #9
  7. SpatialFusion: Endowing Unified Image Generation with Intrinsic 3D Geometric Awareness cs.CV · 2026 · author #6
  8. SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments cs.CV · 2026 · author #18
  9. UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding cs.CV · 2026 · author #10
  10. UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization cs.LG · 2026 · author #10
  11. LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation cs.CV · 2026 · author #9
  12. ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents cs.LG · 2026 · author #6
  13. Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts cs.CV · 2026 · author #10
  14. KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation cs.AI · 2026 · author #15
  15. Rendering Multi-Human and Multi-Object with 3D Gaussian Splatting cs.CV · 2026 · author #5
  16. SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization cs.LG · 2026 · author #9
  17. GroundAct: Can LLM Agents Ground Actions in Environmental States? cs.CL · 2025 · author #11
  18. HeartcareGPT: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understanding cs.LG · 2025 · author #11
  19. Physically Plausible Human-Object Rendering from Sparse Views via 3D Gaussian Splatting cs.GR · 2025 · author #4
  20. HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face cs.CL · 2023 · author #6
  21. Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference cs.CL · 2019 · author #4
  22. Weak Supervision Enhanced Generative Network for Question Generation cs.CL · 2019 · author #6
  23. KCAT: A Knowledge-Constraint Typing Annotation Tool cs.AI · 2019 · author #5
  24. ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering cs.CV · 2019 · author #6
  25. Posterior-regularized REINFORCE for Instance Selection in Distant Supervision cs.CL · 2019 · author #6
  26. Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering cs.CL · 2019 · author #6
  27. What Makes a Good Team? A Large-scale Study on the Effect of Team Composition in Honor of Kings cs.CY · 2019 · author #5
  28. Cross-relation Cross-bag Attention for Distantly-supervised Relation Extraction cs.CL · 2018 · author #5
  29. To Stay or to Leave: Churn Prediction for Urban Migrants in the Initial Period cs.SI · 2018 · author #5
  30. Representation Learning for Scale-free Networks cs.SI · 2017 · author #5
  31. Deeply-Learned Part-Aligned Representations for Person Re-Identification cs.CV · 2017 · author #4
  32. Video Question Answering via Attribute-Augmented Attention Network Learning cs.CV · 2017 · author #6
  33. Urban Dreams of Migrants: A Case Study of Migrant Integration in Shanghai cs.CY · 2017 · author #5
  34. Zero-Shot Recognition using Dual Visual-Semantic Mapping Paths cs.CV · 2017 · author #5
  35. Task-driven Visual Saliency and Attention-based Visual Question Answering cs.CV · 2017 · author #4
  36. Deep Learning Driven Visual Path Prediction from a Single Image cs.CV · 2016 · author #8
  37. Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning cs.CV · 2015 · author #5
  38. DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection cs.CV · 2015 · author #6
  39. Online Metric-Weighted Linear Representations for Robust Visual Tracking cs.CV · 2015 · author #5
  40. Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking cs.CV · 2014 · author #5
  41. Local and global approaches of affinity propagation clustering for large scale data cs.LG · 2009 · author #4

Mentions

  • 2605.30011 #12 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 2508.05614 #11 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 2605.27186 #8 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 2605.26102 #8 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 2605.18621 #7 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 2604.02268 #9 · arxiv_oai · confidence 0.70 Yueting Zhuang
  • 0910.1650 #4 · backfill · confidence 0.70 Yueting Zhuang

Frequent Coauthors