Serving long-context llms at the mobile edge: Test-time reinforcement learning-based model caching and inference offloading

· 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing

cs.LG · 2026-04-08 · unverdicted · novelty 6.0

COMLLM uses multi-turn LLM reasoning via GRPO and LACS to achieve near-optimal latency, better fairness, and zero-shot generalization to larger unseen network topologies in mobile edge computing task offloading.

citing papers explorer

Showing 1 of 1 citing paper.

Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing cs.LG · 2026-04-08 · unverdicted · none · ref 5
COMLLM uses multi-turn LLM reasoning via GRPO and LACS to achieve near-optimal latency, better fairness, and zero-shot generalization to larger unseen network topologies in mobile edge computing task offloading.

Serving long-context llms at the mobile edge: Test-time reinforcement learning-based model caching and inference offloading

fields

years

verdicts

representative citing papers

citing papers explorer