pith. sign in

Haodong Duan

Identifiers

  • name variant Haodong Duan 0.60 · backfill

Papers (19)

  1. Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games cs.CV · 2026 · author #4
  2. Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields cs.AI · 2026 · author #22
  3. Can Retrieval Heads See Images? Multimodal Retrieval Heads in Long-Context Vision-Language Models cs.CV · 2026 · author #10
  4. OpenCompass: A Universal Evaluation Platform for Large Language Models cs.CL · 2026 · author #3
  5. Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context cs.CV · 2026 · author #3
  6. WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation cs.CL · 2026 · author #12
  7. Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs cs.AI · 2026 · author #6
  8. OPT-BENCH: Evaluating the Iterative Self-Optimization of LLM Agents in Large-Scale Search Spaces cs.AI · 2026 · author #5
  9. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #34
  10. MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence cs.CV · 2025 · author #9
  11. Visual-RFT: Visual Reinforcement Fine-Tuning cs.CV · 2025 · author #6
  12. Human Cognitive Benchmarks Reveal Foundational Visual Gaps in MLLMs cs.CV · 2025 · author #10
  13. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #8
  14. Are We on the Right Way for Evaluating Large Vision-Language Models? cs.CV · 2024 · author #7
  15. InternLM2 Technical Report cs.CL · 2024 · author #12
  16. InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model cs.CV · 2024 · author #9
  17. InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition cs.CV · 2023 · author #8
  18. MMBench: Is Your Multi-modal Model an All-around Player? cs.CV · 2023 · author #2
  19. SRPGAN: Perceptual Generative Adversarial Network for Single Image Super Resolution cs.CV · 2017 · author #2

Mentions

  • 2606.19338 #4 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2606.11042 #22 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2605.27243 #10 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2505.23764 #9 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2605.19276 #3 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2309.15112 #8 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2407.03320 #8 · arxiv_oai · confidence 0.70 Haodong Duan
  • 2401.16420 #9 · arxiv_oai · confidence 0.70 Haodong Duan

Frequent Coauthors