pith:EMQ7WYLF
FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation
FlexServe allows ARM TrustZone to protect mobile LLM inference by switching memory and NPU modes on demand, cutting time to first token by over 10x versus rigid baselines.
arxiv:2603.09046 v3 · 2026-03-10 · cs.CR · cs.LG · cs.OS
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{EMQ7WYLFTUKJYMWXXJ2UJT63BF}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
FlexServe achieves an average 10.05× speedup in Time to First Token (TTFT) compared to the strawman, and an average 2.44× TTFT speedup compared to an optimized strawman with pipeline and secure NPU enabled. For multi-model agent workflows, the end-to-end speedup is up to 24.30× and 4.05× compared to the strawman and optimized strawman, respectively.
The flexible switching between protected and unprotected modes for memory and NPU does not introduce new security vulnerabilities or significant unmeasured overheads beyond the reported prototype benchmarks.
FlexServe achieves up to 10x faster time-to-first-token for secure LLM inference on mobile devices by using flexible resource isolation in TrustZone compared to standard approaches.
Cited by
Receipt and verification
| First computed | 2026-07-03T00:16:53.070116Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2321fb61659d149c32d7ba7544cfdb09722cf9d742cfb062e82cc5fc51bec945
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/EMQ7WYLFTUKJYMWXXJ2UJT63BF \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2321fb61659d149c32d7ba7544cfdb09722cf9d742cfb062e82cc5fc51bec945
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "f0f6015630047f5dd504b5c140c4b48f8d9e104b7d3f16d8d557b5bc563535f5",
"cross_cats_sorted": [
"cs.LG",
"cs.OS"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CR",
"submitted_at": "2026-03-10T00:31:25Z",
"title_canon_sha256": "f38ae1944cb60340e583f1ac9d2ee6b87b47688d41e4a10f9415ea7685367f7b"
},
"schema_version": "1.0",
"source": {
"id": "2603.09046",
"kind": "arxiv",
"version": 3
}
}