Title resolution pending

· 2025 · arXiv 2508.00904

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

WattGPU: Predicting Inference Power and Latency on Unseen GPUs and LLMs

cs.DC · 2026-07-02 · unverdicted · novelty 6.0

WattGPU ML models predict LLM inference power and latency on unseen GPUs with median errors of 3.4-13.5% using public data and show better performance than baselines.

Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

Recover-LoRA with synthetic-data distillation recovers 80-95% accuracy on most benchmarks after selective 2-bit quantization of MLP gate/up layers while delivering 7.5-23.3% throughput improvement.

citing papers explorer

Showing 2 of 2 citing papers.

WattGPU: Predicting Inference Power and Latency on Unseen GPUs and LLMs cs.DC · 2026-07-02 · unverdicted · none · ref 16
WattGPU ML models predict LLM inference power and latency on unseen GPUs with median errors of 3.4-13.5% using public data and show better performance than baselines.
Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data cs.LG · 2026-06-02 · unverdicted · none · ref 51
Recover-LoRA with synthetic-data distillation recovers 80-95% accuracy on most benchmarks after selective 2-bit quantization of MLP gate/up layers while delivering 7.5-23.3% throughput improvement.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer