pith. sign in
Pith Number

pith:3UNJHOKX

pith:2026:3UNJHOKX3ZIP752Q66AOLWZAQP
not attested not anchored not stored refs pending

Taming Request Imbalance: SLO-Aware Scheduling for Disaggregated LLM Inference

Qipeng Wang, Zhendong Yang

Kairos improves TTFT SLO attainment by up to 24% and decode throughput by 19% in disaggregated LLM inference.

arxiv:2605.02329 v2 · 2026-05-04 · cs.DC

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{3UNJHOKX3ZIP752Q66AOLWZAQP}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Experimental results demonstrate that, compared with state-of-the-art baselines, Kairos improves TTFT SLO attainment by up to 23.9%, TPOT SLO attainment by up to 27.1%, end-to-end SLO attainment by up to 33.8%, and decode throughput by up to 19.3%.

C2weakest assumption

That the prediction of prefill completion times is accurate enough to make good scheduling decisions and that the chosen online serving dataset accurately reflects production request patterns.

C3one line summary

Kairos improves SLO attainment and throughput in LLM serving by adapting to request length imbalance with priority scheduling and adaptive batching.

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-26T02:05:09.868224Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

dd1a93b957de50fff750f780e5db2083dded13054e9a0bbd29450f62ee6d2bfe

Aliases

arxiv: 2605.02329 · arxiv_version: 2605.02329v2 · doi: 10.48550/arxiv.2605.02329 · pith_short_12: 3UNJHOKX3ZIP · pith_short_16: 3UNJHOKX3ZIP752Q · pith_short_8: 3UNJHOKX
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/3UNJHOKX3ZIP752Q66AOLWZAQP \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: dd1a93b957de50fff750f780e5db2083dded13054e9a0bbd29450f62ee6d2bfe
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "2f81bf5886885711cede8487d5194cbe0e1b50b1eb314e43eb0b87ea60eee50e",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.DC",
    "submitted_at": "2026-05-04T08:29:47Z",
    "title_canon_sha256": "a1c0c0c6b87927f1d32f0c9a2fad3a0392d4fb5f372a6d8dc5d74e4adc743c03"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.02329",
    "kind": "arxiv",
    "version": 2
  }
}