FunReason : Enhancing large language models' function calling via self-refinement multiscale loss and automated data refinement

Bingguang Hao et al · 2025 · arXiv 2505.20192

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

RubricRefine: Improving Tool-Use Agent Reliability with Training-Free Pre-Execution Refinement

cs.LG · 2026-05-10 · unverdicted · novelty 7.0 · 3 refs

RubricRefine is a training-free pre-execution method that creates rubrics to score and fix inter-tool contract violations in agent code, reaching 0.86 average on M3ToolEval across seven models with zero executions and lower latency.

Failure Makes the Agent Stronger: Enhancing Accuracy through Structured Reflection for Reliable Tool Interactions

cs.CV · 2025-09-23 · unverdicted · novelty 5.0

Structured reflection makes error diagnosis and repair an explicit trainable step that improves reliability and reduces redundant calls in tool-using LLM agents.

Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA

cs.IR · 2026-04-07 · unverdicted · novelty 3.0

A pipeline of dataset construction from prior work, AugFC parameter augmentation, and two-step LLM training improves function calling for financial APIs and is running in production.

R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling

cs.LG · 2026-04-22

citing papers explorer

Showing 1 of 1 citing paper after filters.

R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling cs.LG · 2026-04-22 · unreviewed · ref 34

FunReason : Enhancing large language models' function calling via self-refinement multiscale loss and automated data refinement

fields

years

verdicts

representative citing papers

citing papers explorer