2025.PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference

Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu · 2025 · DOI 10.1145/3676641

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Idleness is Relative: Exploiting Tool-Call Idle Windows for Offloading in Agentic Systems with MORI

cs.OS · 2026-05-30 · unverdicted · novelty 6.0

MORI improves throughput 20-71% and TTFT 18-43% over baselines by ranking programs on a continuous idleness spectrum and shifting the GPU-CPU boundary to match capacity in agentic LLM serving.

TokenStack: A Heterogeneous HBM-PIM Architecture and Runtime for Efficient LLM Inference

cs.AR · 2026-05-07 · unverdicted · novelty 6.0

TokenStack's heterogeneous HBM-PIM design with base-die control and topology-aware KV placement delivers 1.62x higher geometric-mean token throughput and 1.70x SLO-compliant serving capacity than AttAcc while cutting per-token energy by 30-47%.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Idleness is Relative: Exploiting Tool-Call Idle Windows for Offloading in Agentic Systems with MORI cs.OS · 2026-05-30 · unverdicted · none · ref 56
MORI improves throughput 20-71% and TTFT 18-43% over baselines by ranking programs on a continuous idleness spectrum and shifting the GPU-CPU boundary to match capacity in agentic LLM serving.
TokenStack: A Heterogeneous HBM-PIM Architecture and Runtime for Efficient LLM Inference cs.AR · 2026-05-07 · unverdicted · none · ref 11
TokenStack's heterogeneous HBM-PIM design with base-die control and topology-aware KV placement delivers 1.62x higher geometric-mean token throughput and 1.70x SLO-compliant serving capacity than AttAcc while cutting per-token energy by 30-47%.

2025.PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer