pith. sign in

Dart: Diffusion-inspired speculative decoding for fast llm inference

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

fields

cs.CL 4 cs.LG 2

years

2026 6

verdicts

UNVERDICTED 6

roles

background 3

polarities

background 3

representative citing papers

SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting

cs.CL · 2026-05-08 · unverdicted · novelty 7.0 · 2 refs

SpecBlock achieves 8-13% higher mean speedup than EAGLE-3 at 44-52% drafting cost via block-iterative drafting with hidden-state inheritance, dynamic rank-head branching, valid-prefix masking, and optional cost-aware bandit adaptation.

Accelerating Speculative Decoding with Block Diffusion Draft Trees

cs.CL · 2026-04-14 · unverdicted · novelty 6.0

DDTree builds a draft tree from a block diffusion drafter using a best-first heap on its output probabilities and verifies the tree in one target-model pass via an ancestor-only attention mask, increasing average accepted tokens per round.

citing papers explorer

Showing 6 of 6 citing papers.