Title resolution pending

Gyeong-In Yu, Joo Seong Jeong, Geon-Woo Kim, Soojeong Kim, Byung-Gon Chun · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference

cs.CL · 2026-05-18 · unverdicted · novelty 6.0

KVDrive introduces a multi-tier KV cache management system that achieves up to 1.74x higher throughput for long-context LLM inference through adaptive cache placement, pipeline restructuring, and cross-tier coordination while preserving accuracy.

Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing

cs.DC · 2026-03-16 · unverdicted · novelty 6.0

CoGPU resolves the tradeoff in GPU sharing by introducing GPU coroutines for semantic-preserving resource migration, delivering up to 79.2% higher training throughput and zero token mismatch in inference.

citing papers explorer

Showing 2 of 2 citing papers.

KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference cs.CL · 2026-05-18 · unverdicted · none · ref 39
KVDrive introduces a multi-tier KV cache management system that achieves up to 1.74x higher throughput for long-context LLM inference through adaptive cache placement, pipeline restructuring, and cross-tier coordination while preserving accuracy.
Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing cs.DC · 2026-03-16 · unverdicted · none · ref 78
CoGPU resolves the tradeoff in GPU sharing by introducing GPU coroutines for semantic-preserving resource migration, delivering up to 79.2% higher training throughput and zero token mismatch in inference.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer