Tool documentation enabl es zero-shot tool-usage with large language models

Cheng-Y u Hsieh, Si-An Chen, Chun-Liang Li, Y asuhisa Fujii, Alexander Ratner, Chen-Y u Lee, Ranjay Krishna, Tomas Pﬁster · 2023 · arXiv 2308.00675

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"?

cs.DB · 2026-02-25 · unverdicted · novelty 7.0

New Text-to-Big SQL metrics show that LLM agents must balance accuracy with cost and speed at scale, where GPT-4o trades some accuracy for up to 12x speedup and GPT-5.2 proves more cost-effective than Gemini 3 Pro on large inputs.

From REST to MCP: An Empirical Study of API Wrapping and Automated Server Generation for LLM Agents

cs.SE · 2025-07-21 · unverdicted · novelty 7.0

First large-scale empirical analysis of MCP server construction shows predominant REST wrapping with low operation exposure, plus an AutoMCP pipeline that improves automated generation success and reduces tool complexity.

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

cs.CL · 2025-08-12 · unverdicted · novelty 5.0 · 2 refs

An automated environment construction pipeline plus verifiable rewards enables RL training that improves LLM tool-use performance across scales without harming general capabilities.

Agentic Reasoning for Large Language Models

cs.AI · 2026-01-18 · unverdicted · novelty 4.0

The survey structures agentic reasoning for LLMs into foundational, self-evolving, and collective multi-agent layers while distinguishing in-context orchestration from post-training optimization and reviewing applications across domains.

Bridging Language Models and Financial Analysis

q-fin.ST · 2025-03-14 · unverdicted · novelty 2.0

A survey synthesizing recent LLM research and assessing its applicability to financial data analysis.

A Comprehensive Overview of Large Language Models

cs.CL · 2023-07-12 · unverdicted · novelty 2.0

A survey paper providing an overview of Large Language Models, their background, and recent advances in the field.

citing papers explorer

Showing 6 of 6 citing papers.

Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"? cs.DB · 2026-02-25 · unverdicted · none · ref 24
New Text-to-Big SQL metrics show that LLM agents must balance accuracy with cost and speed at scale, where GPT-4o trades some accuracy for up to 12x speedup and GPT-5.2 proves more cost-effective than Gemini 3 Pro on large inputs.
From REST to MCP: An Empirical Study of API Wrapping and Automated Server Generation for LLM Agents cs.SE · 2025-07-21 · unverdicted · none · ref 16
First large-scale empirical analysis of MCP server construction shows predominant REST wrapping with low operation exposure, plus an AutoMCP pipeline that improves automated generation success and reduces tool complexity.
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments cs.CL · 2025-08-12 · unverdicted · none · ref 16 · 2 links
An automated environment construction pipeline plus verifiable rewards enables RL training that improves LLM tool-use performance across scales without harming general capabilities.
Agentic Reasoning for Large Language Models cs.AI · 2026-01-18 · unverdicted · none · ref 215
The survey structures agentic reasoning for LLMs into foundational, self-evolving, and collective multi-agent layers while distinguishing in-context orchestration from post-training optimization and reviewing applications across domains.
Bridging Language Models and Financial Analysis q-fin.ST · 2025-03-14 · unverdicted · none · ref 38
A survey synthesizing recent LLM research and assessing its applicability to financial data analysis.
A Comprehensive Overview of Large Language Models cs.CL · 2023-07-12 · unverdicted · none · ref 218
A survey paper providing an overview of Large Language Models, their background, and recent advances in the field.

Tool documentation enabl es zero-shot tool-usage with large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer