RadialRouter: Structured representation for efficient and robust large language models routing.arXiv preprint arXiv:2506.03880, 2025

Jin, R · 2025 · arXiv 2506.03880

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

TACO: Tool-Augmented Credit Optimization for Agentic Tool Use

cs.MA · 2026-06-29 · unverdicted · novelty 6.0

TACO combines Differential Answer-Probe Reward (DAPR) and Outcome-Gated Advantage Routing (OGAR) to assign credit to tool calls in agentic visual reasoning, producing accuracy gains on multimodal benchmarks.

SeqRoute: Global Budget-Aware Sequential LLM Routing via Offline Reinforcement Learning

cs.LG · 2026-05-25 · unverdicted · novelty 6.0

SeqRoute applies offline RL with CQL and Hindsight Budget Relabeling to sequential LLM routing under global budgets, claiming 6.0-73.5% cost reduction, maintained or improved quality, and under 1% bankruptcy rate.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SeqRoute: Global Budget-Aware Sequential LLM Routing via Offline Reinforcement Learning cs.LG · 2026-05-25 · unverdicted · none · ref 11
SeqRoute applies offline RL with CQL and Hindsight Budget Relabeling to sequential LLM routing under global budgets, claiming 6.0-73.5% cost reduction, maintained or improved quality, and under 1% bankruptcy rate.

RadialRouter: Structured representation for efficient and robust large language models routing.arXiv preprint arXiv:2506.03880, 2025

fields

years

verdicts

representative citing papers

citing papers explorer