pith. sign in

Zhiyuan Zhao

Identifiers

  • name variant Zhiyuan Zhao 0.60 · backfill

Papers (9)

  1. Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models cs.CV · 2026 · author #6
  2. Towards Realistic Open-Vocabulary Remote Sensing Segmentation: Benchmark and Baseline cs.CV · 2026 · author #5
  3. MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale cs.CV · 2026 · author #5
  4. HunyuanImage 3.0 Technical Report cs.CV · 2025 · author #71
  5. MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing cs.CV · 2025 · author #6
  6. A Visual Reinforcement Learning-Based Separate Primitive Policy for Peg-in-Hole Tasks cs.RO · 2025 · author #5
  7. MinerU: An Open-Source Solution for Precise Document Content Extraction cs.CV · 2024 · author #6
  8. Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization cs.CV · 2023 · author #1
  9. InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition cs.CV · 2023 · author #7

Mentions

  • 2504.14820 #5 · arxiv_oai · confidence 0.70 Zhiyuan Zhao
  • 2309.15112 #7 · arxiv_oai · confidence 0.70 Zhiyuan Zhao
  • 2509.22186 #6 · arxiv_oai · confidence 0.70 Zhiyuan Zhao
  • 2311.16839 #1 · arxiv_oai · confidence 0.70 Zhiyuan Zhao
  • 2409.18839 #6 · arxiv_oai · confidence 0.70 Zhiyuan Zhao
  • 2509.23951 #71 · arxiv_oai · confidence 0.70 Zhiyuan Zhao

Frequent Coauthors