Haodong Duan — Pith Author Registry

Identifiers

name variant Haodong Duan 0.60 · backfill

Papers (19)

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games cs.CV · 2026 · author #4
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields cs.AI · 2026 · author #22
Can Retrieval Heads See Images? Multimodal Retrieval Heads in Long-Context Vision-Language Models cs.CV · 2026 · author #10
OpenCompass: A Universal Evaluation Platform for Large Language Models cs.CL · 2026 · author #3
Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context cs.CV · 2026 · author #3
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation cs.CL · 2026 · author #12
Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs cs.AI · 2026 · author #6
OPT-BENCH: Evaluating the Iterative Self-Optimization of LLM Agents in Large-Scale Search Spaces cs.AI · 2026 · author #5
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #34
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence cs.CV · 2025 · author #9
Visual-RFT: Visual Reinforcement Fine-Tuning cs.CV · 2025 · author #6
Human Cognitive Benchmarks Reveal Foundational Visual Gaps in MLLMs cs.CV · 2025 · author #10
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #8
Are We on the Right Way for Evaluating Large Vision-Language Models? cs.CV · 2024 · author #7
InternLM2 Technical Report cs.CL · 2024 · author #12
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model cs.CV · 2024 · author #9
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition cs.CV · 2023 · author #8
MMBench: Is Your Multi-modal Model an All-around Player? cs.CV · 2023 · author #2
SRPGAN: Perceptual Generative Adversarial Network for Single Image Super Resolution cs.CV · 2017 · author #2

Mentions

2606.19338 #4 · arxiv_oai · confidence 0.70 Haodong Duan
2606.11042 #22 · arxiv_oai · confidence 0.70 Haodong Duan
2605.27243 #10 · arxiv_oai · confidence 0.70 Haodong Duan
2505.23764 #9 · arxiv_oai · confidence 0.70 Haodong Duan
2605.19276 #3 · arxiv_oai · confidence 0.70 Haodong Duan
2309.15112 #8 · arxiv_oai · confidence 0.70 Haodong Duan
2407.03320 #8 · arxiv_oai · confidence 0.70 Haodong Duan
2401.16420 #9 · arxiv_oai · confidence 0.70 Haodong Duan

Frequent Coauthors

Dahua Lin 11 shared papers
Kai Chen 10 shared papers
Jiaqi Wang 9 shared papers
Songyang Zhang 7 shared papers
Yuhang Zang 7 shared papers
Conghui He 6 shared papers
Xiaoyi Dong 6 shared papers
Yu Qiao 6 shared papers
Pan Zhang 5 shared papers
Wenwei Zhang 5 shared papers
Bin Wang 4 shared papers
Hang Yan 4 shared papers
Jingwen Li 4 shared papers
Linke Ouyang 4 shared papers
Maosong Cao 4 shared papers
Shengyuan Ding 4 shared papers
Wei Li 4 shared papers
Xingcheng Zhang 4 shared papers
Xinyue Zhang 4 shared papers
Xinyu Fang 4 shared papers