High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

(2) AICES; (3) GRS; 4); (4) J\"ulich Aachen Research Alliance -- High-performance Computing); Edoardo Di Napoli (1; Elmar Peise (2); Markus Hrywniak (3); Paolo Bientinesi (2) ((1) J\"ulich Supercomputing Centre; RWTH Aachen University

arxiv: 1602.06589 · v3 · pith:XYSMBVACnew · submitted 2016-02-21 · 💻 cs.CE · cs.DS· cs.PF· physics.comp-ph

High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

Edoardo Di Napoli (1 , 4) , Elmar Peise (2) , Markus Hrywniak (3) , Paolo Bientinesi (2) ((1) J\"ulich Supercomputing Centre , (2) AICES , RWTH Aachen University , (3) GRS

show 1 more author

(4) J\"ulich Aachen Research Alliance -- High-performance Computing)

This is my paper

classification 💻 cs.CE cs.DScs.PFphysics.comp-ph

keywords codesperformancecodekernelsmathematicalmodeloperationsoptimized

0 comments

read the original abstract

One of the greatest efforts of computational scientists is to translate the mathematical model describing a class of physical phenomena into large and complex codes. Many of these codes face the difficulty of implementing the mathematical operations in the model in terms of low level optimized kernels offering both performance and portability. Legacy codes suffer from the additional curse of rigid design choices based on outdated performance metrics (e.g. minimization of memory footprint). Using a representative code from the Materials Science community, we propose a methodology to restructure the most expensive operations in terms of an optimized combination of dense linear algebra kernels. The resulting algorithm guarantees an increased performance and an extended life span of this code enabling larger scale simulations.

This paper has not been read by Pith yet.

High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

discussion (0)