pith. sign in

Torsten Hoefler

Identifiers

  • name variant Torsten Hoefler 0.60 · backfill

Papers (31)

  1. Can AI Weather Models Predict Beyond Two Weeks? A Quantitative Benchmark and Analysis of Long Rollouts cs.LG · 2026 · author #4
  2. Confounder Detection via Treatment Intent: A New Observational Study Design stat.ME · 2026 · author #3
  3. Large Language Model Selection with Limited Annotations cs.CL · 2026 · author #4
  4. Grid Games: The Power of Multiple Grids for Quantizing Large Language Models cs.LG · 2026 · author #5
  5. ADELIA: Automatic Differentiation for Efficient Laplace Inference Approximations cs.DC · 2026 · author #9
  6. Resilient AI Supercomputer Networking using MRC and SRv6 cs.NI · 2026 · author #15
  7. SFT-then-RL Outperforms Mixed-Policy Methods for LLM Reasoning cs.LG · 2026 · author #3
  8. Earth System Foundation Model (ESFM): A unified framework for heterogeneous data integration and forecasting physics.ao-ph · 2026 · author #10
  9. An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience cs.DC · 2026 · author #11
  10. Process Reward Agents for Steering Knowledge-Intensive Reasoning cs.AI · 2026 · author #4
  11. SpaDA: A Spatial Dataflow Architecture Programming Language cs.DC · 2025 · author #3
  12. PICO: Performance Insights for Collective Operations cs.DC · 2025 · author #4
  13. The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm cs.LG · 2025 · author #4
  14. Assessing requirements to scale to practical quantum advantage quant-ph · 2022 · author #5
  15. GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers cs.LG · 2022 · author #3
  16. Graph Processing on FPGAs: Taxonomy, Survey, Challenges cs.DC · 2019 · author #5
  17. A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning cs.DC · 2019 · author #6
  18. Augment your batch: better training with larger batches cs.LG · 2019 · author #5
  19. SimFS: A Simulation Data Virtualizing File System Interface cs.DC · 2019 · author #4
  20. The Convergence of Sparsified Gradient Methods cs.LG · 2018 · author #2
  21. Neural Code Comprehension: A Learnable Representation of Code Semantics cs.LG · 2018 · author #3
  22. Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations cs.DS · 2018 · author #2
  23. {\mu}-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching cs.LG · 2018 · author #3
  24. Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis cs.LG · 2018 · author #2
  25. sPIN: High-performance streaming Processing in the Network cs.DC · 2017 · author #1
  26. Communication-Avoiding Parallel Algorithms for Solving Triangular Systems of Linear Equations cs.DC · 2016 · author #3
  27. Scaling betweenness centrality using communication-efficient sparse matrix multiplication cs.DC · 2016 · author #4
  28. AllConcur: Leaderless Concurrent Atomic Broadcast (Extended Version) cs.DC · 2016 · author #2
  29. SDNsec: Forwarding Accountability for the SDN Data Plane cs.NI · 2016 · author #4
  30. A communication-avoiding parallel algorithm for the symmetric eigenvalue problem cs.DC · 2016 · author #4
  31. Sparse Tensor Algebra as a Parallel Programming Model cs.MS · 2015 · author #2

Mentions

  • 2605.30184 #4 · arxiv_oai · confidence 0.70 Torsten Hoefler
  • 2605.26413 #3 · arxiv_oai · confidence 0.70 Torsten Hoefler
  • 2605.24981 #4 · arxiv_oai · confidence 0.70 Torsten Hoefler

Frequent Coauthors