ISBN 9798400703867

Association for Computing Machinery · 2025

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

When Hidden States Drift: Can KV Caches Rescue Long-Range Speculative Decoding?

cs.CL · 2026-04-29 · unverdicted · novelty 6.0

KV cache reuse improves long-range draft acceptance in speculative decoding but delivers only marginal end-to-end speedups due to drafter limitations.

Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

cs.LG · 2026-04-29 · unverdicted · novelty 5.0

Speculative decoding integrated into NeMo-RL accelerates synchronous RL rollouts by 1.8x at 8B scale and projects up to 2.5x end-to-end training speedup at 235B scale when combined with asynchronous pipelines.

citing papers explorer

Showing 2 of 2 citing papers.

When Hidden States Drift: Can KV Caches Rescue Long-Range Speculative Decoding? cs.CL · 2026-04-29 · unverdicted · none · ref 9
KV cache reuse improves long-range draft acceptance in speculative decoding but delivers only marginal end-to-end speedups due to drafter limitations.
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding cs.LG · 2026-04-29 · unverdicted · none · ref 1
Speculative decoding integrated into NeMo-RL accelerates synchronous RL rollouts by 1.8x at 8B scale and projects up to 2.5x end-to-end training speedup at 235B scale when combined with asynchronous pipelines.

ISBN 9798400703867

fields

years

verdicts

representative citing papers

citing papers explorer