Proceedings of Machine Learning and Systems , year=

Efficiently Scaling Transformer Inference , author=

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Latent Cache Flow: Model-to-Model Communication Without Text

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

Latent Cache Flow uses small adapters to jointly translate and compress KV caches between LLMs, enabling accurate communication even with mismatched contexts and outperforming both prior cache adapters and text in early tests.

citing papers explorer

Showing 1 of 1 citing paper.

Latent Cache Flow: Model-to-Model Communication Without Text cs.LG · 2026-05-19 · unverdicted · none · ref 5
Latent Cache Flow uses small adapters to jointly translate and compress KV caches between LLMs, enabling accurate communication even with mismatched contexts and outperforming both prior cache adapters and text in early tests.

Proceedings of Machine Learning and Systems , year=

fields

years

verdicts

representative citing papers

citing papers explorer