Torsten Hoefler — Pith Author Registry

Identifiers

name variant Torsten Hoefler 0.60 · backfill

Papers (31)

Can AI Weather Models Predict Beyond Two Weeks? A Quantitative Benchmark and Analysis of Long Rollouts cs.LG · 2026 · author #4
Confounder Detection via Treatment Intent: A New Observational Study Design stat.ME · 2026 · author #3
Large Language Model Selection with Limited Annotations cs.CL · 2026 · author #4
Grid Games: The Power of Multiple Grids for Quantizing Large Language Models cs.LG · 2026 · author #5
ADELIA: Automatic Differentiation for Efficient Laplace Inference Approximations cs.DC · 2026 · author #9
Resilient AI Supercomputer Networking using MRC and SRv6 cs.NI · 2026 · author #15
SFT-then-RL Outperforms Mixed-Policy Methods for LLM Reasoning cs.LG · 2026 · author #3
Earth System Foundation Model (ESFM): A unified framework for heterogeneous data integration and forecasting physics.ao-ph · 2026 · author #10
An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience cs.DC · 2026 · author #11
Process Reward Agents for Steering Knowledge-Intensive Reasoning cs.AI · 2026 · author #4
SpaDA: A Spatial Dataflow Architecture Programming Language cs.DC · 2025 · author #3
PICO: Performance Insights for Collective Operations cs.DC · 2025 · author #4
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm cs.LG · 2025 · author #4
Assessing requirements to scale to practical quantum advantage quant-ph · 2022 · author #5
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers cs.LG · 2022 · author #3
Graph Processing on FPGAs: Taxonomy, Survey, Challenges cs.DC · 2019 · author #5
A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning cs.DC · 2019 · author #6
Augment your batch: better training with larger batches cs.LG · 2019 · author #5
SimFS: A Simulation Data Virtualizing File System Interface cs.DC · 2019 · author #4
The Convergence of Sparsified Gradient Methods cs.LG · 2018 · author #2
Neural Code Comprehension: A Learnable Representation of Code Semantics cs.LG · 2018 · author #3
Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations cs.DS · 2018 · author #2
{\mu}-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching cs.LG · 2018 · author #3
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis cs.LG · 2018 · author #2
sPIN: High-performance streaming Processing in the Network cs.DC · 2017 · author #1
Communication-Avoiding Parallel Algorithms for Solving Triangular Systems of Linear Equations cs.DC · 2016 · author #3
Scaling betweenness centrality using communication-efficient sparse matrix multiplication cs.DC · 2016 · author #4
AllConcur: Leaderless Concurrent Atomic Broadcast (Extended Version) cs.DC · 2016 · author #2
SDNsec: Forwarding Accountability for the SDN Data Plane cs.NI · 2016 · author #4
A communication-avoiding parallel algorithm for the symmetric eigenvalue problem cs.DC · 2016 · author #4
Sparse Tensor Algebra as a Parallel Programming Model cs.MS · 2015 · author #2

Mentions

2605.30184 #4 · arxiv_oai · confidence 0.70 Torsten Hoefler
2605.26413 #3 · arxiv_oai · confidence 0.70 Torsten Hoefler
2605.24981 #4 · arxiv_oai · confidence 0.70 Torsten Hoefler

Frequent Coauthors

Tal Ben-Nun 7 shared papers
Dan Alistarh 4 shared papers
Edgar Solomonik 4 shared papers
Maciej Besta 4 shared papers
Alexandros Nikolaos Ziogas 2 shared papers
Benedikt Soja 2 shared papers
Fanny Lehmann 2 shared papers
Firat Ozdemir 2 shared papers
Imanol Schlag 2 shared papers
Patrik Okanovic 2 shared papers
Salvatore Di Girolamo 2 shared papers
Sebastian Schemm 2 shared papers
Siddhartha Mishra 2 shared papers
Thomas Schulthess 2 shared papers
Yun Cheng 2 shared papers
Aarthi Sundaram 1 shared papers
Abdul Kabbani 1 shared papers
Abhishek Dosi 1 shared papers
Adrian Perrig 1 shared papers
Adrian Popa 1 shared papers