pith. sign in

Dan Alistarh

Identifiers

  • name variant Dan Alistarh 0.60 · backfill

Papers (32)

  1. Apertus LLM Family Expansion via Distillation and Quantization cs.LG · 2026 · author #4
  2. Grid Games: The Power of Multiple Grids for Quantizing Large Language Models cs.LG · 2026 · author #6
  3. MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning cs.CL · 2026 · author #3
  4. Statistically-Lossless Quantization of Large Language Models cs.LG · 2026 · author #3
  5. Model Compression with Exact Budget Constraints via Riemannian Manifolds cs.LG · 2026 · author #2
  6. GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling cs.CL · 2026 · author #7
  7. Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation cs.LG · 2026 · author #4
  8. WUSH: Near-Optimal Adaptive Transforms for LLM Quantization cs.LG · 2025 · author #5
  9. Expand Neurons, Not Parameters cs.LG · 2025 · author #5
  10. The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm cs.LG · 2025 · author #5
  11. The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws cs.LG · 2025 · author #6
  12. "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization cs.LG · 2024 · author #5
  13. Scalable Mechanistic Neural Networks for Differential Equations and Machine Learning cs.LG · 2024 · author #4
  14. GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers cs.LG · 2022 · author #4
  15. Distributed Learning over Unreliable Networks cs.DC · 2018 · author #6
  16. The Convergence of Sparsified Gradient Methods cs.LG · 2018 · author #1
  17. Relaxed Schedulers Can Efficiently Parallelize Iterative Algorithms cs.DS · 2018 · author #1
  18. The Transactional Conflict Problem cs.DC · 2018 · author #1
  19. Byzantine Stochastic Gradient Descent cs.LG · 2018 · author #1
  20. The Convergence of Stochastic Gradient Descent in Asynchronous Shared Memory cs.DC · 2018 · author #1
  21. Model compression via distillation and quantization cs.NE · 2018 · author #3
  22. DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation cs.ET · 2018 · author #2
  23. The Power of Choice in Priority Scheduling cs.DS · 2017 · author #1
  24. Space-Optimal Majority in Population Protocols cs.DC · 2017 · author #1
  25. The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning cs.LG · 2016 · author #4
  26. QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding cs.LG · 2016 · author #1
  27. Time-Space Trade-offs in Population Protocols cs.DC · 2016 · author #1
  28. Polylogarithmic-Time Leader Election in Population Protocols Using Polylogarithmic States cs.DC · 2015 · author #1
  29. How to Elect a Leader Faster than a Tournament cs.DC · 2014 · author #1
  30. Inherent Limitations of Hybrid Transactional Memory cs.DC · 2014 · author #1
  31. The LevelArray: A Fast, Practical Long-Lived Renaming Algorithm cs.DC · 2014 · author #1
  32. Are Lock-Free Concurrent Algorithms Practically Wait-Free? cs.DC · 2013 · author #1

Mentions

  • 2501.12486 #6 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 1502.05745 #1 · backfill · confidence 0.70 Dan Alistarh
  • 2601.22813 #4 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 2512.00956 #5 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 1411.1001 #1 · backfill · confidence 0.70 Dan Alistarh
  • 2410.06074 #4 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 2510.04500 #5 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 1405.5689 #1 · backfill · confidence 0.70 Dan Alistarh
  • 1405.5461 #1 · backfill · confidence 0.70 Dan Alistarh
  • 2605.29128 #4 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 1311.3200 #1 · backfill · confidence 0.70 Dan Alistarh
  • 2411.02355 #5 · arxiv_oai · confidence 0.70 Dan Alistarh
  • 2604.18556 #7 · arxiv_oai · confidence 0.70 Dan Alistarh

Frequent Coauthors