pith. sign in

Xunzhuo Liu

Identifiers

  • name variant Xunzhuo Liu 0.60 · backfill

Papers (5)

  1. Dual-Pool Token-Budget Routing for Cost-Efficient and Reliable LLM Serving cs.CL · 2026 · author #1
  2. The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project cs.LG · 2026 · author #2
  3. The 1/W Law: An Analytical Study of Context-Length Routing Topology and GPU Generation Gains for LLM Inference Energy Efficiency cs.DC · 2026 · author #2
  4. Token-Budget-Aware Pool Routing for Cost-Efficient LLM Inference cs.DC · 2026 · author #2
  5. vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models cs.NI · 2026 · author #1

Mentions

  • 2603.04444 #1 · arxiv_oai · confidence 0.70 Xunzhuo Liu

Frequent Coauthors