pith. sign in

Georg Hager

Identifiers

  • name variant Georg Hager 0.60 · backfill

Papers (59)

  1. Architectural Trade-offs in the Energy-Efficient Era: A Comparative Study of power-capping NVIDIA H100 and H200 cs.PF · 2026 · author #3
  2. Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters cs.DC · 2026 · author #2
  3. Cache Blocking of Distributed-Memory Parallel Matrix Power Kernels cs.DC · 2024 · author #4
  4. Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs math.NA · 2023 · author #3
  5. Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects physics.comp-ph · 2018 · author #9
  6. On the accuracy and usefulness of analytic energy models for contemporary multicore processors cs.PF · 2018 · author #2
  7. Validation of hardware events for successful performance pattern identification in High Performance Computing cs.DC · 2017 · author #3
  8. CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance cs.DC · 2017 · author #5
  9. LIKWID Monitoring Stack: A flexible framework enabling job specific performance monitoring for the masses cs.DC · 2017 · author #3
  10. An analysis of core- and chip-level architectural features in four generations of Intel server processors cs.PF · 2017 · author #2
  11. Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels cs.PF · 2017 · author #3
  12. Performance analysis of the Kahan-enhanced scalar product on current multi- and manycore processors cs.PF · 2016 · author #5
  13. Analysis of Intel's Haswell Microarchitecture Using The ECM Model and Microbenchmarks cs.DC · 2015 · author #4
  14. Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization cs.CE · 2015 · author #3
  15. Multi-dimensional intra-tile parallelization for memory-starved stencil computations cs.DC · 2015 · author #2
  16. High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations math.NA · 2015 · author #6
  17. Automatic Loop Kernel Analysis and Performance Modeling With Kerncraft cs.PF · 2015 · author #2
  18. GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems cs.DC · 2015 · author #9
  19. Short Note on Costs of Floating Point Operations on current x86-64 Architectures: Denormals, Overflow, Underflow, and Division by Zero cs.PF · 2015 · author #3
  20. Building a fault tolerant application using the GASPI communication layer cs.DC · 2015 · author #6
  21. Performance analysis of the Kahan-enhanced scalar product on current multicore processors cs.PF · 2015 · author #4
  22. Electron confinement in graphene with gate-defined quantum dots cond-mat.mes-hall · 2015 · author #2
  23. Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking cs.PF · 2014 · author #2
  24. Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems cs.CE · 2014 · author #2
  25. Quantifying performance bottlenecks of stencil computations using the Execution-Cache-Memory model cs.PF · 2014 · author #3
  26. Multicore-optimized wavefront diamond blocking for optimizing stencil updates cs.DC · 2014 · author #2
  27. Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips cs.DC · 2014 · author #3
  28. Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator cs.DC · 2013 · author #3
  29. A unified sparse matrix data format for efficient general sparse matrix-vector multiply on modern processors with wide SIMD units cs.MS · 2013 · author #2
  30. Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations cs.PF · 2013 · author #2
  31. Optimization of FASTEST-3D for Modern Multicore Systems cs.PF · 2013 · author #2
  32. Model-guided Performance Analysis of the Sparse Matrix-Matrix Multiplication cs.PF · 2013 · author #3
  33. Asynchronous MPI for the Masses cs.DC · 2013 · author #2
  34. Exploring performance and power properties of modern multicore chips via simple machine models cs.PF · 2012 · author #1
  35. Best practices for HPM-assisted performance engineering on modern multicore processors cs.PF · 2012 · author #2
  36. Sparse matrix-vector multiplication on GPGPU clusters: A new storage format and a scalable implementation cs.DC · 2011 · author #2
  37. Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results cs.PF · 2011 · author #4
  38. Domain decomposition and locality optimization for large-scale lattice Boltzmann simulations cs.DC · 2011 · author #3
  39. Comparison of different Propagation Steps for the Lattice Boltzmann Method cs.DC · 2011 · author #3
  40. Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems cs.DC · 2011 · author #3
  41. Pushing the limits for medical image reconstruction on recent standard multicore processors cs.PF · 2011 · author #2
  42. LIKWID: Lightweight Performance Tools cs.DC · 2011 · author #2
  43. Expression Templates Revisited: A Performance Analysis of the Current ET Methodology cs.PF · 2011 · author #2
  44. Optimizing ccNUMA locality for task-parallel execution under OpenMP and TBB on multicore-based systems cs.DC · 2010 · author #2
  45. Parallel sparse matrix-vector multiplication as a test case for hybrid MPI+OpenMP programming cs.PF · 2010 · author #2
  46. A Flexible Patch-Based Lattice Boltzmann Parallelization Approach for Heterogeneous GPU-CPU Clusters cs.DC · 2010 · author #4
  47. Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters cs.DC · 2010 · author #2
  48. LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments cs.DC · 2010 · author #2
  49. Efficient multicore-aware parallelization strategies for iterative stencil computations cs.PF · 2010 · author #3
  50. Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory cs.PF · 2009 · author #2
  51. Multi-core architectures: Complexities of performance prediction and the impact of cache topology cs.PF · 2009 · author #2
  52. Performance limitations for sparse matrix-vector multiplications on current multicore environments cs.PF · 2009 · author #2
  53. Introducing a Performance Model for Bandwidth-Limited Loop Kernels cs.PF · 2009 · author #2
  54. A Proof of Concept for Optimizing Task Parallelism by Locality Queues cs.PF · 2009 · author #2
  55. RZBENCH: Performance evaluation of current HPC architectures using low-level and application benchmarks cs.DC · 2007 · author #1
  56. Data access optimizations for highly threaded multi-core CPUs with multiple memory controllers cs.DC · 2007 · author #1
  57. Carrier-density effects in many-polaron systems cond-mat.str-el · 2006 · author #2
  58. Phase diagram of the spin-Peierls chain with local coupling cond-mat.str-el · 2006 · author #2
  59. The spin-Peierls chain revisited cond-mat.str-el · 2006 · author #1

Mentions

  • 1511.03639 #4 · backfill · confidence 0.70 Georg Hager
  • 1510.05218 #3 · backfill · confidence 0.70 Georg Hager
  • 1510.04995 #2 · backfill · confidence 0.70 Georg Hager
  • 1510.04895 #6 · backfill · confidence 0.70 Georg Hager
  • 1509.03778 #2 · backfill · confidence 0.70 Georg Hager
  • 1507.08101 #9 · backfill · confidence 0.70 Georg Hager
  • 1506.03997 #3 · backfill · confidence 0.70 Georg Hager
  • 1112.5588 #2 · arxiv_oai · confidence 0.70 Georg Hager
  • 1505.04628 #6 · backfill · confidence 0.70 Georg Hager
  • 1505.02586 #4 · backfill · confidence 0.70 Georg Hager
  • 1503.05815 #2 · backfill · confidence 0.70 Georg Hager
  • 1410.5561 #2 · backfill · confidence 0.70 Georg Hager
  • 1410.5242 #2 · backfill · confidence 0.70 Georg Hager
  • 1410.5010 #3 · backfill · confidence 0.70 Georg Hager
  • 1410.3060 #2 · backfill · confidence 0.70 Georg Hager
  • 1401.7494 #3 · backfill · confidence 0.70 Georg Hager
  • 1401.3615 #3 · backfill · confidence 0.70 Georg Hager
  • 1307.6209 #2 · backfill · confidence 0.70 Georg Hager
  • 1304.7664 #2 · backfill · confidence 0.70 Georg Hager
  • 1303.4538 #2 · backfill · confidence 0.70 Georg Hager
  • 1303.1651 #3 · backfill · confidence 0.70 Georg Hager
  • 1302.4280 #2 · backfill · confidence 0.70 Georg Hager
  • 1208.2908 #1 · backfill · confidence 0.70 Georg Hager
  • 1206.3738 #2 · backfill · confidence 0.70 Georg Hager
  • 1112.5588 #2 · backfill · confidence 0.70 Georg Hager
  • 1112.0850 #4 · backfill · confidence 0.70 Georg Hager
  • 1111.1129 #3 · backfill · confidence 0.70 Georg Hager
  • 1111.0922 #3 · backfill · confidence 0.70 Georg Hager
  • 1106.5908 #3 · backfill · confidence 0.70 Georg Hager
  • 1104.5243 #2 · backfill · confidence 0.70 Georg Hager
  • 1104.4874 #2 · backfill · confidence 0.70 Georg Hager
  • 1104.1729 #2 · backfill · confidence 0.70 Georg Hager
  • 1101.0093 #2 · backfill · confidence 0.70 Georg Hager
  • 1101.0091 #2 · backfill · confidence 0.70 Georg Hager
  • 1007.1388 #4 · backfill · confidence 0.70 Georg Hager
  • 1006.3148 #2 · backfill · confidence 0.70 Georg Hager
  • 1004.4431 #2 · backfill · confidence 0.70 Georg Hager
  • 1004.1741 #3 · backfill · confidence 0.70 Georg Hager
  • 0912.4506 #2 · backfill · confidence 0.70 Georg Hager
  • 0910.4865 #2 · backfill · confidence 0.70 Georg Hager
  • 0910.4836 #2 · backfill · confidence 0.70 Georg Hager
  • 0905.0792 #2 · backfill · confidence 0.70 Georg Hager
  • 0902.1884 #2 · backfill · confidence 0.70 Georg Hager
  • 0712.3389 #1 · backfill · confidence 0.70 Georg Hager
  • 0712.2302 #1 · backfill · confidence 0.70 Georg Hager

Frequent Coauthors