URL https://www.aclweb

Association for Computational Linguistics · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling

cs.DC · 2026-01-29 · unverdicted · novelty 5.0

ZipMoE delivers up to 72.77% lower inference latency and 6.76x higher throughput for on-device MoE models via lossless compression and cache-affinity scheduling with a claimed provable guarantee.

citing papers explorer

Showing 1 of 1 citing paper.

ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling cs.DC · 2026-01-29 · unverdicted · none · ref 15
ZipMoE delivers up to 72.77% lower inference latency and 6.76x higher throughput for on-device MoE models via lossless compression and cache-affinity scheduling with a claimed provable guarantee.

URL https://www.aclweb

fields

years

verdicts

representative citing papers

citing papers explorer