pith. sign in

Efficient memory management for large language model serving with pagedattention

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.AR 1 cs.CL 1

years

2025 1 2023 1

representative citing papers

A Survey of Large Language Models

cs.CL · 2023-03-31 · accept · novelty 3.0

This survey reviews the background, key techniques, and evaluation methods for large language models, emphasizing emergent abilities that appear at large scales.

citing papers explorer

Showing 2 of 2 citing papers.

  • MIST: A Co-Design Framework for Heterogeneous, Multi-Stage LLM Inference cs.AR · 2025-04-14 · unverdicted · none · ref 35

    MIST is a new simulator for heterogeneous multi-stage LLM inference that combines hardware traces with analytical models to explore configuration trade-offs in hybrid CPU-accelerator systems.

  • A Survey of Large Language Models cs.CL · 2023-03-31 · accept · none · ref 212

    This survey reviews the background, key techniques, and evaluation methods for large language models, emphasizing emergent abilities that appear at large scales.