pith. sign in

Cost-effective online multi-llm selection with versatile reward models

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.DB 1 cs.LG 1

years

2026 2

representative citing papers

Continuous Semantic Caching for Low-Cost LLM Serving

cs.LG · 2026-04-21 · unverdicted · novelty 7.0

Establishes the first rigorous framework for continuous semantic caching of LLM responses using ε-net discretization and kernel ridge regression, with sublinear regret bounds.

citing papers explorer

Showing 2 of 2 citing papers.

  • Continuous Semantic Caching for Low-Cost LLM Serving cs.LG · 2026-04-21 · unverdicted · none · ref 7

    Establishes the first rigorous framework for continuous semantic caching of LLM responses using ε-net discretization and kernel ridge regression, with sublinear regret bounds.

  • KRONE: Scalable LLM-Augmented Log Anomaly Detection via Hierarchical Abstraction cs.DB · 2026-02-07 · conditional · none · ref 73

    KRONE derives semantic execution hierarchies from flat logs to enable modular multi-level anomaly detection with hybrid local and nested-aware detectors plus limited LLM use, delivering 10% F1 gains and over 100x data efficiency on benchmarks and industrial data.