Thriftllm: On cost-effective selection of large language models for classification queries

· 2025 · arXiv 2501.04901

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Adaptive Graph Refinement and Label Propagation with LLMs for Cost-Effective Entity Resolution

cs.CL · 2026-05-25 · unverdicted · novelty 6.0

Alper unifies entity resolution matching and clustering into an iterative graph refinement and probabilistic label propagation process that adaptively selects LLM queries via a budgeted greedy optimization to outperform cascaded pipelines on eight benchmarks.

Policy-Governed LLM Routing with Intent Matching for Instrument Laboratories

cs.CY · 2026-04-03 · conditional · novelty 6.0

A governed LLM routing system for lab tutoring raises challenge-alignment from 0.90 to 0.98, boosts productive-struggle time, and cuts token costs by two-thirds while preserving answer accuracy.

Semantic Data Processing with Holistic Data Understanding

cs.DB · 2026-04-03 · unverdicted · novelty 6.0

HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.

Online LLM Selection via Constrained Bandits with Time-Varying Demand

cs.LG · 2026-06-16 · unverdicted · novelty 5.0

Develops a constrained bandit algorithm for online LLM selection under packing and covering constraints with time-varying demand, claiming sublinear regret and constraint violations versus an offline full-information benchmark.

citing papers explorer

Showing 4 of 4 citing papers.

Adaptive Graph Refinement and Label Propagation with LLMs for Cost-Effective Entity Resolution cs.CL · 2026-05-25 · unverdicted · none · ref 25
Alper unifies entity resolution matching and clustering into an iterative graph refinement and probabilistic label propagation process that adaptively selects LLM queries via a budgeted greedy optimization to outperform cascaded pipelines on eight benchmarks.
Policy-Governed LLM Routing with Intent Matching for Instrument Laboratories cs.CY · 2026-04-03 · conditional · none · ref 10
A governed LLM routing system for lab tutoring raises challenge-alignment from 0.90 to 0.98, boosts productive-struggle time, and cuts token costs by two-thirds while preserving answer accuracy.
Semantic Data Processing with Holistic Data Understanding cs.DB · 2026-04-03 · unverdicted · none · ref 25
HoldUp uses LLM-guided clustering to provide holistic dataset context for semantic operators, yielding up to 33% higher classification accuracy and 30% higher scoring accuracy than row-by-row LLM processing across 15 datasets.
Online LLM Selection via Constrained Bandits with Time-Varying Demand cs.LG · 2026-06-16 · unverdicted · none · ref 13
Develops a constrained bandit algorithm for online LLM selection under packing and covering constraints with time-varying demand, claiming sublinear regret and constraint violations versus an offline full-information benchmark.

Thriftllm: On cost-effective selection of large language models for classification queries

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer