pith. sign in

Jiayu Liu

Identifiers

  • name variant Jiayu Liu 0.60 · backfill

Papers (21)

  1. PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems cs.AI · 2026 · author #1
  2. BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery cs.AI · 2026 · author #7
  3. AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints cs.CL · 2026 · author #1
  4. Brick-Composer: Using MLLMs for Assembly with Diverse Bricks cs.AI · 2026 · author #7
  5. SocraticPO: Policy Optimization via Interactive Guidance cs.LG · 2026 · author #5
  6. $\Psi$-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues cs.LG · 2026 · author #3
  7. MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models cs.CL · 2026 · author #4
  8. UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind cs.CL · 2026 · author #2
  9. Advancing Creative Physical Intelligence in Large Multimodal Models cs.AI · 2026 · author #3
  10. Termination-Dependent Surface States and Magnetic Fingerprints of Chiral Helimagnet Cr1/3TaS2 cond-mat.mtrl-sci · 2026 · author #9
  11. CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing cs.AI · 2026 · author #3
  12. Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? cs.CL · 2026 · author #5
  13. Step-Level Sparse Autoencoder for Reasoning Process Interpretation cs.LG · 2026 · author #2
  14. NOVA: NOise-aware Verbal Confidence CAlibration for Robust Large Language Models in RAG Systems cs.CL · 2026 · author #1
  15. Linear Dynamics in the RLVR Training of Large Language Models cs.LG · 2026 · author #2
  16. CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents cs.AI · 2025 · author #1
  17. Deep Thinking by Markov Chain of Continuous Thoughts cs.LG · 2025 · author #1
  18. Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization cs.AI · 2025 · author #2
  19. Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty cs.AI · 2025 · author #3
  20. Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty? cs.CL · 2025 · author #1
  21. Machine learning of phase transitions in the percolation and XY models cond-mat.stat-mech · 2018 · author #2

Mentions

  • 2511.02734 #1 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.22388 #1 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.20997 #7 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2601.11004 #1 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.09887 #5 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.05622 #1 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.05445 #7 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2606.02754 #3 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2603.03202 #5 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2603.03031 #2 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2605.28009 #4 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2605.27721 #2 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2605.26396 #3 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2605.24415 #9 · arxiv_oai · confidence 0.70 Jiayu Liu
  • 2601.04537 #2 · arxiv_oai · confidence 0.70 Jiayu Liu

Frequent Coauthors