pith. machine review for the scientific record. sign in

Jian Luan

Identifiers

No identifiers captured yet.

Papers (19)

  1. PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media cs.CV · 2026 · author #9
  2. Beyond Binary: Reframing GUI Critique as Continuous Semantic Alignment cs.LG · 2026 · author #8
  3. How Mobile World Model Guides GUI Agents? cs.AI · 2026 · author #11
  4. Reducing Linguistic Hallucination in LM-Based Speech Enhancement via Noise-Invariant Acoustic-Semantic Distillation eess.AS · 2026 · author #8
  5. Listening with Time: Precise Temporal Awareness for Long-Form Audio Understanding eess.AS · 2026 · author #8
  6. TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis cs.CL · 2026 · author #10
  7. ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling cs.MM · 2026 · author #13
  8. Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA cs.CL · 2026 · author #10
  9. Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models cs.CV · 2026 · author #10
  10. Borderless Long Speech Synthesis cs.SD · 2026 · author #15
  11. From Ideal to Real: Stable Video Object Removal under Imperfect Conditions cs.CV · 2026 · author #7
  12. Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension cs.CV · 2026 · author #8
  13. Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation cs.CV · 2026 · author #9
  14. Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models cs.CL · 2026 · author #9
  15. REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding cs.CV · 2025 · author #10
  16. Revisiting Entropy in Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #8
  17. MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks eess.AS · 2025 · author #10
  18. Mobile GUI Agents under Real-world Threats: Are We There Yet? cs.CR · 2025 · author #7
  19. End-to-End Optimization of LLM-Driven Multi-Agent Search Systems via Heterogeneous-Group-Based Reinforcement Learning cs.LG · 2025 · author #5

Mentions

No mention provenance yet.

Frequent Coauthors