Georg Hager

Identifiers

name variant Georg Hager 0.60 · backfill

Papers (59)

Architectural Trade-offs in the Energy-Efficient Era: A Comparative Study of power-capping NVIDIA H100 and H200 cs.PF · 2026 · author #3
Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters cs.DC · 2026 · author #2
Cache Blocking of Distributed-Memory Parallel Matrix Power Kernels cs.DC · 2024 · author #4
Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs math.NA · 2023 · author #3
Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects physics.comp-ph · 2018 · author #9
On the accuracy and usefulness of analytic energy models for contemporary multicore processors cs.PF · 2018 · author #2
Validation of hardware events for successful performance pattern identification in High Performance Computing cs.DC · 2017 · author #3
CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance cs.DC · 2017 · author #5
LIKWID Monitoring Stack: A flexible framework enabling job specific performance monitoring for the masses cs.DC · 2017 · author #3
An analysis of core- and chip-level architectural features in four generations of Intel server processors cs.PF · 2017 · author #2
Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels cs.PF · 2017 · author #3
Performance analysis of the Kahan-enhanced scalar product on current multi- and manycore processors cs.PF · 2016 · author #5
Analysis of Intel's Haswell Microarchitecture Using The ECM Model and Microbenchmarks cs.DC · 2015 · author #4
Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization cs.CE · 2015 · author #3
Multi-dimensional intra-tile parallelization for memory-starved stencil computations cs.DC · 2015 · author #2
High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations math.NA · 2015 · author #6
Automatic Loop Kernel Analysis and Performance Modeling With Kerncraft cs.PF · 2015 · author #2
GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems cs.DC · 2015 · author #9
Short Note on Costs of Floating Point Operations on current x86-64 Architectures: Denormals, Overflow, Underflow, and Division by Zero cs.PF · 2015 · author #3
Building a fault tolerant application using the GASPI communication layer cs.DC · 2015 · author #6
Performance analysis of the Kahan-enhanced scalar product on current multicore processors cs.PF · 2015 · author #4
Electron confinement in graphene with gate-defined quantum dots cond-mat.mes-hall · 2015 · author #2
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking cs.PF · 2014 · author #2
Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems cs.CE · 2014 · author #2
Quantifying performance bottlenecks of stencil computations using the Execution-Cache-Memory model cs.PF · 2014 · author #3
Multicore-optimized wavefront diamond blocking for optimizing stencil updates cs.DC · 2014 · author #2
Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips cs.DC · 2014 · author #3
Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator cs.DC · 2013 · author #3
A unified sparse matrix data format for efficient general sparse matrix-vector multiply on modern processors with wide SIMD units cs.MS · 2013 · author #2
Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations cs.PF · 2013 · author #2
Optimization of FASTEST-3D for Modern Multicore Systems cs.PF · 2013 · author #2
Model-guided Performance Analysis of the Sparse Matrix-Matrix Multiplication cs.PF · 2013 · author #3
Asynchronous MPI for the Masses cs.DC · 2013 · author #2
Exploring performance and power properties of modern multicore chips via simple machine models cs.PF · 2012 · author #1
Best practices for HPM-assisted performance engineering on modern multicore processors cs.PF · 2012 · author #2
Sparse matrix-vector multiplication on GPGPU clusters: A new storage format and a scalable implementation cs.DC · 2011 · author #2
Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results cs.PF · 2011 · author #4
Domain decomposition and locality optimization for large-scale lattice Boltzmann simulations cs.DC · 2011 · author #3
Comparison of different Propagation Steps for the Lattice Boltzmann Method cs.DC · 2011 · author #3
Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems cs.DC · 2011 · author #3
Pushing the limits for medical image reconstruction on recent standard multicore processors cs.PF · 2011 · author #2
LIKWID: Lightweight Performance Tools cs.DC · 2011 · author #2
Expression Templates Revisited: A Performance Analysis of the Current ET Methodology cs.PF · 2011 · author #2
Optimizing ccNUMA locality for task-parallel execution under OpenMP and TBB on multicore-based systems cs.DC · 2010 · author #2
Parallel sparse matrix-vector multiplication as a test case for hybrid MPI+OpenMP programming cs.PF · 2010 · author #2
A Flexible Patch-Based Lattice Boltzmann Parallelization Approach for Heterogeneous GPU-CPU Clusters cs.DC · 2010 · author #4
Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters cs.DC · 2010 · author #2
LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments cs.DC · 2010 · author #2
Efficient multicore-aware parallelization strategies for iterative stencil computations cs.PF · 2010 · author #3
Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory cs.PF · 2009 · author #2
Multi-core architectures: Complexities of performance prediction and the impact of cache topology cs.PF · 2009 · author #2
Performance limitations for sparse matrix-vector multiplications on current multicore environments cs.PF · 2009 · author #2
Introducing a Performance Model for Bandwidth-Limited Loop Kernels cs.PF · 2009 · author #2
A Proof of Concept for Optimizing Task Parallelism by Locality Queues cs.PF · 2009 · author #2
RZBENCH: Performance evaluation of current HPC architectures using low-level and application benchmarks cs.DC · 2007 · author #1
Data access optimizations for highly threaded multi-core CPUs with multiple memory controllers cs.DC · 2007 · author #1
Carrier-density effects in many-polaron systems cond-mat.str-el · 2006 · author #2
Phase diagram of the spin-Peierls chain with local coupling cond-mat.str-el · 2006 · author #2
The spin-Peierls chain revisited cond-mat.str-el · 2006 · author #1

Mentions

1511.03639 #4 · backfill · confidence 0.70 Georg Hager
1510.05218 #3 · backfill · confidence 0.70 Georg Hager
1510.04995 #2 · backfill · confidence 0.70 Georg Hager
1510.04895 #6 · backfill · confidence 0.70 Georg Hager
1509.03778 #2 · backfill · confidence 0.70 Georg Hager
1507.08101 #9 · backfill · confidence 0.70 Georg Hager
1506.03997 #3 · backfill · confidence 0.70 Georg Hager
1112.5588 #2 · arxiv_oai · confidence 0.70 Georg Hager
1505.04628 #6 · backfill · confidence 0.70 Georg Hager
1505.02586 #4 · backfill · confidence 0.70 Georg Hager
1503.05815 #2 · backfill · confidence 0.70 Georg Hager
1410.5561 #2 · backfill · confidence 0.70 Georg Hager
1410.5242 #2 · backfill · confidence 0.70 Georg Hager
1410.5010 #3 · backfill · confidence 0.70 Georg Hager
1410.3060 #2 · backfill · confidence 0.70 Georg Hager
1401.7494 #3 · backfill · confidence 0.70 Georg Hager
1401.3615 #3 · backfill · confidence 0.70 Georg Hager
1307.6209 #2 · backfill · confidence 0.70 Georg Hager
1304.7664 #2 · backfill · confidence 0.70 Georg Hager
1303.4538 #2 · backfill · confidence 0.70 Georg Hager
1303.1651 #3 · backfill · confidence 0.70 Georg Hager
1302.4280 #2 · backfill · confidence 0.70 Georg Hager
1208.2908 #1 · backfill · confidence 0.70 Georg Hager
1206.3738 #2 · backfill · confidence 0.70 Georg Hager
1112.5588 #2 · backfill · confidence 0.70 Georg Hager
1112.0850 #4 · backfill · confidence 0.70 Georg Hager
1111.1129 #3 · backfill · confidence 0.70 Georg Hager
1111.0922 #3 · backfill · confidence 0.70 Georg Hager
1106.5908 #3 · backfill · confidence 0.70 Georg Hager
1104.5243 #2 · backfill · confidence 0.70 Georg Hager
1104.4874 #2 · backfill · confidence 0.70 Georg Hager
1104.1729 #2 · backfill · confidence 0.70 Georg Hager
1101.0093 #2 · backfill · confidence 0.70 Georg Hager
1101.0091 #2 · backfill · confidence 0.70 Georg Hager
1007.1388 #4 · backfill · confidence 0.70 Georg Hager
1006.3148 #2 · backfill · confidence 0.70 Georg Hager
1004.4431 #2 · backfill · confidence 0.70 Georg Hager
1004.1741 #3 · backfill · confidence 0.70 Georg Hager
0912.4506 #2 · backfill · confidence 0.70 Georg Hager
0910.4865 #2 · backfill · confidence 0.70 Georg Hager
0910.4836 #2 · backfill · confidence 0.70 Georg Hager
0905.0792 #2 · backfill · confidence 0.70 Georg Hager
0902.1884 #2 · backfill · confidence 0.70 Georg Hager
0712.3389 #1 · backfill · confidence 0.70 Georg Hager
0712.2302 #1 · backfill · confidence 0.70 Georg Hager

Frequent Coauthors

Gerhard Wellein 47 shared papers
Holger Fehske 15 shared papers
Jan Treibig 15 shared papers
Markus Wittmann 9 shared papers
Thomas Zeiser 9 shared papers
Jan Eitzinger 8 shared papers
Moritz Kreutzer 8 shared papers
Johannes Hofmann 7 shared papers
Andreas Pieper 5 shared papers
Dietmar Fey 5 shared papers
Faisal Shahzad 4 shared papers
Hatem Ltaief 4 shared papers
Jonas Thies 4 shared papers
Achim Basermann 3 shared papers
Alan R. Bishop 3 shared papers
Andreas Alvermann 3 shared papers
David Keyes 3 shared papers
Gerald Schubert 3 shared papers
Holger Stengel 3 shared papers
Johannes Habich 3 shared papers