Towards uncertainty-aware language agent, 2024

Jiuzhou Han, Wray Buntine, Ehsan Shareghi · 2024 · arXiv 2401.14016

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Proper Scoring Rules for Agentic Uncertainty Quantification

cs.AI · 2026-05-23 · unverdicted · novelty 7.0

Introduces Trajectory Proper Score (TPS) as a strictly proper family of trajectory-level scoring rules that elicits the complete prefix-conditioned success probability process.

Helicase: Uncertainty-Guided Supply Chain Knowledge Graph Construction with Autonomous Multi-Agent LLMs

cs.AI · 2026-05-26 · unverdicted · novelty 6.0

Helicase proposes an autonomous multi-agent LLM framework for uncertainty-guided supply chain knowledge graph construction evaluated on the new SCQA benchmark of 80 queries.

Strategic Decision Support for AI Agents

cs.AI · 2026-06-10 · unverdicted · novelty 5.0

The paper introduces an optimization framework for AI agents to strategically seek support, proving a threshold policy on support value and providing an online algorithm to control missed-support error without distributional assumptions.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Proper Scoring Rules for Agentic Uncertainty Quantification cs.AI · 2026-05-23 · unverdicted · none · ref 16
Introduces Trajectory Proper Score (TPS) as a strictly proper family of trajectory-level scoring rules that elicits the complete prefix-conditioned success probability process.
Helicase: Uncertainty-Guided Supply Chain Knowledge Graph Construction with Autonomous Multi-Agent LLMs cs.AI · 2026-05-26 · unverdicted · none · ref 9
Helicase proposes an autonomous multi-agent LLM framework for uncertainty-guided supply chain knowledge graph construction evaluated on the new SCQA benchmark of 80 queries.
Strategic Decision Support for AI Agents cs.AI · 2026-06-10 · unverdicted · none · ref 29
The paper introduces an optimization framework for AI agents to strategically seek support, proving a threshold policy on support value and providing an online algorithm to control missed-support error without distributional assumptions.

Towards uncertainty-aware language agent, 2024

fields

years

verdicts

representative citing papers

citing papers explorer