pith. sign in

Guangtao Zhai

Identifiers

  • name variant Guangtao Zhai 0.60 · backfill

Papers (39)

  1. DroneIQA-VLE: Multi-Task Drone Image Quality Assessment via Vision-Language Ensemble cs.CV · 2026 · author #6
  2. Beyond Single Character: Evaluating MLLMs for Sentence-Level Oracle Bone Inscription Understanding cs.CV · 2026 · author #4
  3. LatentRevise: Learning from Zero-Hit Reasoning cs.CL · 2026 · author #4
  4. LEIQ-Assessor: Multi-dimensional Quality Assessment of Low-light Enhanced Images via Multi-task Learning cs.CV · 2026 · author #7
  5. In-context Region-based Drag: Drag Any Region to Any Shape cs.CV · 2026 · author #5
  6. RoboProcessBench: Benchmarking Process-Aware Understanding in Vision-Language Robotic Manipulation cs.RO · 2026 · author #10
  7. Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating cs.CL · 2026 · author #9
  8. Towards Characterizing Scientific Image Utility and Upgradability cs.CV · 2026 · author #9
  9. LL-Bench: Rethinking Low-Level Vision Evaluation in the Era of Large-Scale Generative Models cs.CV · 2026 · author #8
  10. ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research cs.LG · 2026 · author #47
  11. DyCoRM: Dynamic Criterion-Aware Reward Modeling for Text-to-Image Generation cs.CV · 2026 · author #7
  12. Efficient One-Step Diffusion Restoration Model with Compact Token Compression and Linear Attention cs.CV · 2026 · author #5
  13. Sketch Then Paint: Hierarchical Reinforcement Learning for Diffusion Multi-Modal Large Language Models cs.AI · 2026 · author #10
  14. PVRF: All-in-one Adverse Weather Removal via Prior-modulated and Velocity-constrained Rectified Flow cs.CV · 2026 · author #7
  15. GeoR-Bench: Evaluating Geoscience Visual Reasoning cs.CV · 2026 · author #10
  16. ReasonEdit: Towards Interpretable Image Editing Evaluation via Reinforcement Learning cs.CV · 2026 · author #6
  17. EditRefiner: A Human-Aligned Agentic Framework for Image Editing Refinement cs.CV · 2026 · author #12
  18. DynT2I-Eval: A Dynamic Evaluation Framework for Text-to-Image Models cs.CV · 2026 · author #5
  19. LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore) cs.CV · 2026 · author #18
  20. MirrorBench: Evaluating Self-centric Intelligence in MLLMs by Introducing a Mirror cs.AI · 2026 · author #6
  21. DPC-VQA: Decoupling Quality Perception and Residual Calibration for Video Quality Assessment cs.CV · 2026 · author #6
  22. AT-ADD: All-Type Audio Deepfake Detection Challenge Evaluation Plan cs.SD · 2026 · author #13
  23. ITIScore: An Image-to-Text-to-Image Rating Framework for the Image Captioning Ability of MLLMs cs.CV · 2026 · author #8
  24. UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities cs.CL · 2026 · author #10
  25. FUMO: Prior-Modulated Diffusion for Single Image Reflection Removal cs.CV · 2026 · author #3
  26. SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond cs.LG · 2026 · author #17
  27. ELIQ: A Label-Free Framework for Quality Assessment of Evolving AI-Generated Images cs.CV · 2026 · author #8
  28. EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment cs.CV · 2026 · author #6
  29. Robust Mesh Saliency Ground Truth Acquisition in VR via View Cone Sampling and Manifold Diffusion cs.CV · 2026 · author #10
  30. Generalizable Video Quality Assessment via Weak-to-Strong Learning cs.CV · 2025 · author #8
  31. Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model eess.IV · 2025 · author #10
  32. Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming cs.MM · 2024 · author #10
  33. Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels cs.CV · 2023 · author #13
  34. An Algorithm for Transmitting VR Video Based on Adaptive Modulation cs.NI · 2019 · author #3
  35. Adversarial Attacks against Deep Saliency Models cs.CV · 2019 · author #3
  36. Quality Assessment of Free-viewpoint Videos by Quantifying the Elastic Changes of Multi-Scale Motion Trajectories cs.MM · 2019 · author #5
  37. Invariance Analysis of Saliency Models versus Human Gaze During Scene Free Viewing cs.CV · 2018 · author #3
  38. Terahertz Security Image Quality Assessment by No-reference Model Observers cs.CV · 2017 · author #3
  39. Temporal Psychovisual Modulation: a new paradigm of information display cs.ET · 2012 · author #2

Mentions

  • 2607.00416 #6 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2603.19036 #3 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.31169 #4 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.29938 #4 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.29752 #7 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.25907 #5 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.13040 #10 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.09068 #9 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.07591 #47 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.03401 #9 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2606.02535 #8 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2603.23160 #10 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2605.25876 #7 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2602.01173 #6 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2505.03631 #8 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2605.23451 #5 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 1201.1601 #2 · backfill · confidence 0.70 Guangtao Zhai
  • 2605.16842 #10 · arxiv_oai · confidence 0.70 Guangtao Zhai
  • 2312.17090 #13 · arxiv_oai · confidence 0.70 Guangtao Zhai

Frequent Coauthors