Irt-router: Effective and interpretable multi-llm routing via item response theory

IRT-Router: Effective, Interpretable Multi-LLM Routing via Item Response Theory , author= · 2025 · arXiv 2506.01048

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

SWE-Router: Routing in Multi-turn Agentic Software Engineering Tasks

cs.SE · 2026-06-30 · unverdicted · novelty 6.0

SWE-Router introduces trajectory-conditioned value-based routing for LLM agents on SWE tasks, with a Bayes-optimality theorem and empirical cost savings while retaining most strong-model performance.

The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers

cs.LG · 2026-05-27 · unverdicted · novelty 6.0

LLM routers across 21 methods on 5 benchmarks converge to similar accuracy below oracle due to learning global performance trends rather than fine-grained query signals.

IR3DE: A Linear Router for Large Language Models

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

IR3DE is a ridge regression router for domain-expert LLMs that matches or exceeds baselines in language modeling and reasoning tasks while allowing dynamic expert addition or removal without retraining.

Measuring Competency, Not Performance: Item-Aware Evaluation Across Medical Benchmarks

cs.CL · 2025-09-29 · conditional · novelty 5.0

MedIRT applies Item Response Theory to medical LLM benchmarks to separate latent competency from item difficulty and discrimination, producing more stable rankings and revealing domain heterogeneity than accuracy alone.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Measuring Competency, Not Performance: Item-Aware Evaluation Across Medical Benchmarks cs.CL · 2025-09-29 · conditional · none · ref 28
MedIRT applies Item Response Theory to medical LLM benchmarks to separate latent competency from item difficulty and discrimination, producing more stable rankings and revealing domain heterogeneity than accuracy alone.

Irt-router: Effective and interpretable multi-llm routing via item response theory

fields

years

verdicts

representative citing papers

citing papers explorer