GreenCache dynamically manages LLM KV cache resources to reduce carbon emissions by 15.1% on average (up to 25.3%) while meeting latency constraints for over 90% of requests on real traces.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
CONDITIONAL 2representative citing papers
Equilibria delivers per-container fairness controls and observability for CXL memory tiering, improving production workload performance by up to 52% over Linux TPP while suppressing noisy-neighbor interference.
citing papers explorer
-
Cache Your Prompt When It's Green: Carbon-Aware Caching for Large Language Model Serving
GreenCache dynamically manages LLM KV cache resources to reduce carbon emissions by 15.1% on average (up to 25.3%) while meeting latency constraints for over 90% of requests on real traces.
-
Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale
Equilibria delivers per-container fairness controls and observability for CXL memory tiering, improving production workload performance by up to 52% over Linux TPP while suppressing noisy-neighbor interference.