Do we still need io schedulers for low-latency disks?,

· 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

DUAL-BLADE: Dual-Path NVMe-Direct KV-Cache Offloading for Edge LLM Inference

cs.DC · 2026-04-29 · unverdicted · novelty 6.0

DUAL-BLADE uses a dual-path KV-cache framework with NVMe-direct access to reduce prefill and decode latency by up to 33% and 42% while improving SSD utilization 2.2x under tight memory budgets.

citing papers explorer

Showing 1 of 1 citing paper.

DUAL-BLADE: Dual-Path NVMe-Direct KV-Cache Offloading for Edge LLM Inference cs.DC · 2026-04-29 · unverdicted · none · ref 36
DUAL-BLADE uses a dual-path KV-cache framework with NVMe-direct access to reduce prefill and decode latency by up to 33% and 42% while improving SSD utilization 2.2x under tight memory budgets.

Do we still need io schedulers for low-latency disks?,

fields

years

verdicts

representative citing papers

citing papers explorer