Figure 3: TAU display of the performance of a 16-process MPI program with multiple PAPI metrics, subroutine breakdown, and call graph display.

Back to Article