Content

Node-Level Performance Engineering tutorial to be featured again at SC23

October 23, 2023

Our popular “Node-Level Performance Engineering” full-day tutorial has been accepted again (now the twelfth time in a row!) for presentation at SC23, the International Conference for High Performance Computing, Networking, Storage and Analysis. Together with Thomas Gruber and Gerhard Wellein I will teach the basics of node-level computer architecture, the LIKWID performance tools suite, analytic performance modeling (via the Roofline model), and model-guided optimization. Find the details in the official SC23 agenda.

Get the gist of it in our flashy promo video:

IACS Stony Brook seminar talk available

October 21, 2021

On October 14, 2021 I gave an invited online talk at Stony Brook University‘s Institute for Advanced Computational Science (IACS). I talked about white/gray-box approaches to performance modeling and how they can fail in interesting ways on highly parallel systems because of desynchronization effects. The slides and a video recording are now available:

Title: From numbers to insight via performance models

Abstract: High-performance parallel computers are complex systems. There seems to be a general consensus among developers that the performance of application programs is to be taken for granted, and that it cannot really be understood in terms of simple rules and models. This talk is about using analytic performance models to make sense of performance numbers. By means of examples from computational science, I will motivate that it makes a lot of sense to try and set up performance models even if their accuracy is sometimes limited. In fact, it is when a model yields false predictions that we learn more about the problem because our assumptions are challenged. I will start with a general categorization of performance models and then turn to ECM and Roofline models for loop-based code on multicore CPUs. Going beyond the compute node level and adding communication models to the mix, I will show how stacking models on top of each other may not work as intended but instead open new insights and a fresh view on how massively parallel code is executed.

SC20 tutorial “Node-Level Performance Engineering”

October 14, 2020

Our most popular tutorial was accepted again for the SC20 conference in Atlanta! SC is a 100% virtual event this year. The tutorial will be airing on November 9 and 10 as a number of pre-recorded presentations and live Q&A sessions. There’s still time to register: https://show.jspargo.com/sc20/

Node-Level Performance Engineering tutorial to be featured again at SC17

October 19, 2017

Our popular “Node-Level Performance Engineering” full-day tutorial has been accepted again (now the sixth time in a row!) for presentation at SC17, the International Conference for High Performance Computing, Networking, Storage and Analysis. We teach the basics of node-level computer architecture, analytic performance modeling (via the Roofline model), and model-guided optimization. Watch this cool video to whet your appetite:

When: November 12, 2017, 8:30am-5:00pm

Where: Colorado Convention Center, Denver, CO.

Georg Hager's Blog

Random thoughts on High Performance Computing

Content

Node-Level Performance Engineering tutorial to be featured again at SC23

IACS Stony Brook seminar talk available

SC20 tutorial “Node-Level Performance Engineering”

Node-Level Performance Engineering tutorial to be featured again at SC17