pith. sign in

Forecasting llm inference performance via hardware-agnostic analytical modeling,

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.DC 2 cs.LG 1

years

2026 3

verdicts

UNVERDICTED 3

clear filters

representative citing papers

Latency Prediction for LLM Inference on NPU Systems

cs.DC · 2026-06-16 · unverdicted · novelty 7.0

LENS predicts NPU LLM inference latency with 2.15% mean error by profiling each bucket with two E2E measurements and composing results to capture bucketing non-linearity.

citing papers explorer

Showing 2 of 2 citing papers after filters.