When to ensemble: Identifying token-level points for stable and fast llm ensembling

· 2025 · arXiv 2510.15346

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

SpecFed: Accelerating Federated LLM Inference with Speculative Decoding and Compressed Transmission

eess.SP · 2026-04-28 · unverdicted · novelty 5.0

SpecFed accelerates federated LLM inference via speculative decoding for parallel processing and top-K compression with server-side reconstruction, achieving high fidelity with reduced communication overhead.

citing papers explorer

Showing 1 of 1 citing paper.

SpecFed: Accelerating Federated LLM Inference with Speculative Decoding and Compressed Transmission eess.SP · 2026-04-28 · unverdicted · none · ref 8
SpecFed accelerates federated LLM inference via speculative decoding for parallel processing and top-K compression with server-side reconstruction, achieving high fidelity with reduced communication overhead.

When to ensemble: Identifying token-level points for stable and fast llm ensembling

fields

years

verdicts

representative citing papers

citing papers explorer