pith. sign in
Pith Number

pith:TAIAO7EI

pith:2026:TAIAO7EI7K5AYX5NZ6EW4UKHKE
not attested not anchored not stored refs pending

RouterWise: Joint Resource Allocation and Routing for Latency-Aware Multi-Model LLM Serving

Adel N. Toosi, Christopher Leckie, Gholamreza Haffari, Hossein Hosseini Kasnavieh

Jointly tuning GPU shares and routing fractions across models raises output quality by up to 87 percent while meeting a fixed latency target.

arxiv:2604.10907 v2 · 2026-04-13 · cs.NI · cs.DC

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{TAIAO7EI7K5AYX5NZ6EW4UKHKE}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

even on the same GPU cluster, achievable output-quality score can vary by up to 87% across retained setups, highlighting that resource allocation is a key determinant of routing performance.

C2weakest assumption

The latency models obtained from system profiling accurately predict end-to-end latency when the routing policy induces a particular load on each model under a chosen resource allocation.

C3one line summary

Joint resource allocation and routing for multi-model LLM serving can produce up to 87% variation in achievable output quality across setups on the same GPU cluster.

Receipt and verification
First computed 2026-06-23T02:13:23.752366Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

9810077c88faba0c5fadcf896e51475139b565c320149bddc58b74d993b89d82

Aliases

arxiv: 2604.10907 · arxiv_version: 2604.10907v2 · doi: 10.48550/arxiv.2604.10907 · pith_short_12: TAIAO7EI7K5A · pith_short_16: TAIAO7EI7K5AYX5N · pith_short_8: TAIAO7EI
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/TAIAO7EI7K5AYX5NZ6EW4UKHKE \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 9810077c88faba0c5fadcf896e51475139b565c320149bddc58b74d993b89d82
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "d7290951045fc9ea0eb4d833ce03c9388bc1fc153ed5ab3ca2bb2dcc7ae7e211",
    "cross_cats_sorted": [
      "cs.DC"
    ],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.NI",
    "submitted_at": "2026-04-13T02:13:13Z",
    "title_canon_sha256": "c0c4483432a33db3252166058425750f8d4a3af39d32a9d3698efa9df1369c9b"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2604.10907",
    "kind": "arxiv",
    "version": 2
  }
}