InProceedings of the International Confer- ence on Distributed Artificial Intelligence (DAI)

Beyond GPT-5: Making LLMs cheaper, better via performance-efficiency optimized routing · arXiv 2508.12631

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

HyDRA: Hybrid Dynamic Routing Architecture for Heterogeneous LLM Pools

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

HyDRA routes queries to cost-effective LLMs by predicting multi-dimensional capability requirements with a multi-head encoder and applying shortfall matching against configuration-defined model profiles, delivering up to 72.5 percent cost savings on coding benchmarks while remaining decoupled from具体

citing papers explorer

Showing 1 of 1 citing paper.

HyDRA: Hybrid Dynamic Routing Architecture for Heterogeneous LLM Pools cs.CL · 2026-05-16 · unverdicted · none · ref 19
HyDRA routes queries to cost-effective LLMs by predicting multi-dimensional capability requirements with a multi-head encoder and applying shortfall matching against configuration-defined model profiles, delivering up to 72.5 percent cost savings on coding benchmarks while remaining decoupled from具体

InProceedings of the International Confer- ence on Distributed Artificial Intelligence (DAI)

fields

years

verdicts

representative citing papers

citing papers explorer