pith. sign in

Haitao Mi

Identifiers

  • name variant Haitao Mi 0.60 · backfill

Papers (23)

  1. Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis cs.AI · 2026 · author #6
  2. DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification cs.CL · 2026 · author #7
  3. Reinforcing Multimodal Reasoning Against Visual Degradation cs.CV · 2026 · author #7
  4. Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding cs.LG · 2026 · author #6
  5. Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data cs.LG · 2026 · author #5
  6. Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration cs.AI · 2026 · author #7
  7. Verified Critical Step Optimization for LLM Agents cs.CL · 2026 · author #7
  8. Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification cs.AI · 2026 · author #6
  9. WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models cs.CL · 2025 · author #11
  10. Self-Rewarding Vision-Language Model via Reasoning Decomposition cs.CV · 2025 · author #10
  11. R-Zero: Self-Evolving Reasoning LLM from Zero Data cs.LG · 2025 · author #8
  12. Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training cs.AI · 2025 · author #18
  13. DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning cs.CL · 2025 · author #14
  14. Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs cs.CL · 2024 · author #13
  15. Scaling Synthetic Data Creation with 1,000,000,000 Personas cs.CL · 2024 · author #5
  16. Multi-Perspective Context Matching for Machine Comprehension cs.CL · 2016 · author #2
  17. Temporal Attention Model for Neural Machine Translation cs.CL · 2016 · author #2
  18. Supervised Attentions for Neural Machine Translation cs.CL · 2016 · author #1
  19. Sense Embedding Learning for Word Sense Induction cs.CL · 2016 · author #3
  20. Vocabulary Manipulation for Neural Machine Translation cs.CL · 2016 · author #1
  21. Coverage Embedding Models for Neural Machine Translation cs.CL · 2016 · author #1
  22. Sentence Similarity Learning by Lexical Decomposition and Composition cs.CL · 2016 · author #2
  23. Semi-supervised Clustering for Short Text via Deep Representation Learning cs.CL · 2016 · author #2

Mentions

  • 2504.11456 #14 · arxiv_oai · confidence 0.70 Haitao Mi
  • 2406.20094 #5 · arxiv_oai · confidence 0.70 Haitao Mi

Frequent Coauthors