Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_0 | 52.47 | 0 | 0 | 0 | 0.81 | 0.06 | 0 | 0 | 0 | 5.54 | 0 | 0 | 93.6 | 0 |
▼Node skylake | 52.47 | 0 | 0 | 0 | 0.81 | 0.06 | 0 | 0 | 0 | 5.54 | 0 | 0 | 93.6 | 0 |
▼Process 2419280 | 52.47 | 0 | 0 | 0 | 0.81 | 0.06 | 0 | 0 | 0 | 5.54 | 0 | 0 | 93.6 | 0 |
○Thread 2419280 | 52.47 | 0 | 0 | 0 | 0.81 | 0.06 | 0 | 0 | 0 | 5.54 | 0 | 0 | 93.6 | 0 |
▼run_1 | 24.14 | 0 | 0 | 2.38 | 0.94 | 0.04 | 0 | 0 | 0 | 5.97 | 0 | 0 | 90.66 | 0.01 |
▼Node skylake | 24.14 | 0 | 0 | 2.38 | 0.94 | 0.04 | 0 | 0 | 0 | 5.97 | 0 | 0 | 90.66 | 0.01 |
▼Process 2419344 | 24.14 | 0 | 0 | 2.38 | 0.94 | 0.04 | 0 | 0 | 0 | 5.97 | 0 | 0 | 90.66 | 0.01 |
○Thread 2419344 | 24.14 | 0 | 0 | 2.96 | 0.93 | 0.06 | 0 | 0 | 0 | 7.42 | 0 | 0 | 88.61 | 0.02 |
○Thread 2419397 | 23.17 | 0 | 0 | 1.77 | 0.95 | 0.02 | 0 | 0 | 0 | 4.47 | 0 | 0 | 92.79 | 0 |
▼run_2 | 13.3 | 0 | 0 | 10.51 | 0.87 | 0.03 | 0 | 0 | 0 | 6.18 | 0 | 0 | 82.4 | 0.01 |
▼Node skylake | 13.3 | 0 | 0 | 10.51 | 0.87 | 0.03 | 0 | 0 | 0 | 6.18 | 0 | 0 | 82.4 | 0.01 |
▼Process 2419404 | 13.3 | 0 | 0 | 10.51 | 0.87 | 0.03 | 0 | 0 | 0 | 6.18 | 0 | 0 | 82.4 | 0.01 |
○Thread 2419404 | 13.3 | 0 | 0 | 10.19 | 1.02 | 0.08 | 0 | 0 | 0 | 10.08 | 0 | 0 | 78.61 | 0.04 |
○Thread 2419457 | 12.76 | 0 | 0 | 4.51 | 0.86 | 0.04 | 0 | 0 | 0 | 5.61 | 0 | 0 | 88.98 | 0 |
○Thread 2419458 | 12.32 | 0 | 0 | 12.7 | 0.69 | 0 | 0 | 0 | 0 | 4.99 | 0 | 0 | 81.62 | 0 |
○Thread 2419459 | 12.62 | 0 | 0 | 14.78 | 0.91 | 0 | 0 | 0 | 0 | 3.8 | 0 | 0 | 80.51 | 0 |
▼run_3 | 6.75 | 0 | 0 | 10.62 | 0.82 | 0.1 | 0 | 0 | 0 | 7.08 | 0 | 0 | 81.37 | 0.01 |
▼Node skylake | 6.75 | 0 | 0 | 10.62 | 0.82 | 0.1 | 0 | 0 | 0 | 7.08 | 0 | 0 | 81.37 | 0.01 |
▼Process 2419464 | 6.75 | 0 | 0 | 10.62 | 0.82 | 0.1 | 0 | 0 | 0 | 7.08 | 0 | 0 | 81.37 | 0.01 |
○Thread 2419464 | 6.75 | 0 | 0 | 6.08 | 0.59 | 0.22 | 0 | 0 | 0 | 15.34 | 0 | 0 | 77.69 | 0.07 |
○Thread 2419517 | 6.23 | 0 | 0 | 7.23 | 0.64 | 0.32 | 0 | 0 | 0 | 5.7 | 0 | 0 | 86.1 | 0 |
○Thread 2419518 | 6.22 | 0 | 0 | 10.46 | 0.8 | 0.08 | 0 | 0 | 0 | 6.68 | 0 | 0 | 81.98 | 0 |
○Thread 2419519 | 6.22 | 0 | 0 | 11.58 | 0.97 | 0 | 0 | 0 | 0 | 6.36 | 0 | 0 | 81.09 | 0 |
○Thread 2419520 | 6.21 | 0 | 0 | 13.05 | 1.21 | 0.16 | 0 | 0 | 0 | 5.32 | 0 | 0 | 80.26 | 0 |
○Thread 2419521 | 6.42 | 0 | 0 | 11.85 | 0.78 | 0 | 0 | 0 | 0 | 4.99 | 0 | 0 | 82.39 | 0 |
○Thread 2419522 | 6.41 | 0 | 0 | 14.12 | 0.7 | 0 | 0 | 0 | 0 | 5.46 | 0 | 0 | 79.72 | 0 |
○Thread 2419523 | 6.42 | 0 | 0 | 10.83 | 0.86 | 0 | 0 | 0 | 0 | 6.24 | 0 | 0 | 82.07 | 0 |
▼run_4 | 3.72 | 0 | 0 | 16.8 | 0.77 | 0.1 | 0 | 0 | 0 | 5.87 | 0 | 0 | 76.46 | 0.01 |
▼Node skylake | 3.72 | 0 | 0 | 16.8 | 0.77 | 0.1 | 0 | 0 | 0 | 5.87 | 0 | 0 | 76.46 | 0.01 |
▼Process 2419531 | 3.72 | 0 | 0 | 16.8 | 0.77 | 0.1 | 0 | 0 | 0 | 5.87 | 0 | 0 | 76.46 | 0.01 |
○Thread 2419531 | 3.72 | 0 | 0 | 7.12 | 0.4 | 0.54 | 0 | 0 | 0 | 22.85 | 0 | 0 | 68.95 | 0.13 |
○Thread 2419584 | 3.37 | 0 | 0 | 15.26 | 0.89 | 0.15 | 0 | 0 | 0 | 5.48 | 0 | 0 | 78.22 | 0 |
○Thread 2419585 | 3.38 | 0 | 0 | 16.72 | 0.74 | 0.15 | 0 | 0 | 0 | 5.33 | 0 | 0 | 77.07 | 0 |
○Thread 2419586 | 3.39 | 0 | 0 | 18.41 | 0.44 | 0 | 0 | 0 | 0 | 4.12 | 0 | 0 | 77.03 | 0 |
○Thread 2419587 | 3.44 | 0 | 0 | 16.57 | 1.16 | 0.15 | 0 | 0 | 0 | 6.69 | 0 | 0 | 75.44 | 0 |
○Thread 2419588 | 3.43 | 0 | 0 | 18.51 | 1.02 | 0 | 0 | 0 | 0 | 5.69 | 0 | 0 | 74.78 | 0 |
○Thread 2419589 | 3.43 | 0 | 0 | 17.9 | 1.02 | 0.15 | 0 | 0 | 0 | 3.64 | 0 | 0 | 77.29 | 0 |
○Thread 2419590 | 3.44 | 0 | 0 | 17.44 | 0.29 | 0 | 0 | 0 | 0 | 5.81 | 0 | 0 | 76.45 | 0 |
○Thread 2419591 | 3.44 | 0 | 0 | 16.86 | 1.02 | 0 | 0 | 0 | 0 | 3.05 | 0 | 0 | 79.07 | 0 |
○Thread 2419592 | 3.44 | 0 | 0 | 17.71 | 0.44 | 0.29 | 0 | 0 | 0 | 4.06 | 0 | 0 | 77.5 | 0 |
○Thread 2419593 | 3.44 | 0 | 0 | 17.42 | 0.73 | 0 | 0 | 0 | 0 | 5.08 | 0 | 0 | 76.78 | 0 |
○Thread 2419594 | 3.44 | 0 | 0 | 18.9 | 0.87 | 0.15 | 0 | 0 | 0 | 4.22 | 0 | 0 | 75.87 | 0 |
○Thread 2419595 | 3.44 | 0 | 0 | 18.9 | 1.02 | 0 | 0 | 0 | 0 | 3.63 | 0 | 0 | 76.45 | 0 |
○Thread 2419596 | 3.44 | 0 | 0 | 17.27 | 1.16 | 0 | 0 | 0 | 0 | 3.77 | 0 | 0 | 77.79 | 0 |
○Thread 2419597 | 3.44 | 0 | 0 | 16.4 | 0.87 | 0 | 0 | 0 | 0 | 4.5 | 0 | 0 | 78.23 | 0 |
○Thread 2419598 | 3.44 | 0 | 0 | 18.17 | 0.29 | 0 | 0 | 0 | 0 | 4.51 | 0 | 0 | 77.03 | 0 |
▼run_5 | 2.65 | 0 | 0 | 21.19 | 0.78 | 0.06 | 0 | 0 | 0 | 5.23 | 0 | 0 | 72.74 | 0 |
▼Node skylake | 2.65 | 0 | 0 | 21.19 | 0.78 | 0.06 | 0 | 0 | 0 | 5.23 | 0 | 0 | 72.74 | 0 |
▼Process 2419605 | 2.65 | 0 | 0 | 21.19 | 0.78 | 0.06 | 0 | 0 | 0 | 5.23 | 0 | 0 | 72.74 | 0 |
○Thread 2419605 | 2.65 | 0 | 0 | 3.21 | 0.19 | 0.38 | 0 | 0 | 0 | 31.57 | 0 | 0 | 64.65 | 0 |
○Thread 2419658 | 2.27 | 0 | 0 | 22.91 | 0.44 | 0 | 0 | 0 | 0 | 4.41 | 0 | 0 | 72.25 | 0 |
○Thread 2419659 | 2.3 | 0 | 0 | 22.17 | 0.65 | 0 | 0 | 0 | 0 | 6.52 | 0 | 0 | 70.65 | 0 |
○Thread 2419660 | 2.3 | 0 | 0 | 23.7 | 1.52 | 0.22 | 0 | 0 | 0 | 7.61 | 0 | 0 | 66.96 | 0 |
○Thread 2419661 | 2.3 | 0 | 0 | 24.13 | 0.22 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 70.65 | 0 |
○Thread 2419662 | 2.3 | 0 | 0 | 23.04 | 0.43 | 0 | 0 | 0 | 0 | 5.87 | 0 | 0 | 70.65 | 0 |
○Thread 2419663 | 2.3 | 0 | 0 | 24.35 | 0.87 | 0.22 | 0 | 0 | 0 | 4.35 | 0 | 0 | 70.22 | 0 |
○Thread 2419664 | 2.3 | 0 | 0 | 22.34 | 0.65 | 0 | 0 | 0 | 0 | 2.39 | 0 | 0 | 74.62 | 0 |
○Thread 2419665 | 2.31 | 0 | 0 | 21.69 | 0.22 | 0 | 0 | 0 | 0 | 4.56 | 0 | 0 | 73.54 | 0 |
○Thread 2419666 | 2.31 | 0 | 0 | 21.65 | 1.3 | 0 | 0 | 0 | 0 | 3.03 | 0 | 0 | 74.03 | 0 |
○Thread 2419667 | 2.31 | 0 | 0 | 21.43 | 0.65 | 0 | 0 | 0 | 0 | 4.33 | 0 | 0 | 73.59 | 0 |
○Thread 2419668 | 2.31 | 0 | 0 | 23.16 | 0.22 | 0.22 | 0 | 0 | 0 | 3.68 | 0 | 0 | 72.73 | 0 |
○Thread 2419669 | 2.31 | 0 | 0 | 20.35 | 0.43 | 0 | 0 | 0 | 0 | 4.11 | 0 | 0 | 75.11 | 0 |
○Thread 2419670 | 2.31 | 0 | 0 | 21.43 | 1.08 | 0 | 0 | 0 | 0 | 2.81 | 0 | 0 | 74.68 | 0 |
○Thread 2419671 | 2.31 | 0 | 0 | 21.21 | 1.3 | 0.22 | 0 | 0 | 0 | 3.9 | 0 | 0 | 73.38 | 0 |
○Thread 2419672 | 2.31 | 0 | 0 | 22.29 | 0.43 | 0 | 0 | 0 | 0 | 3.25 | 0 | 0 | 74.03 | 0 |
○Thread 2419673 | 2.31 | 0 | 0 | 22.08 | 0.65 | 0 | 0 | 0 | 0 | 4.33 | 0 | 0 | 72.94 | 0 |
○Thread 2419674 | 2.31 | 0 | 0 | 21.43 | 1.3 | 0 | 0 | 0 | 0 | 3.03 | 0 | 0 | 74.24 | 0 |
○Thread 2419675 | 2.31 | 0 | 0 | 21 | 1.3 | 0 | 0 | 0 | 0 | 2.38 | 0 | 0 | 75.32 | 0 |
○Thread 2419676 | 2.31 | 0 | 0 | 21.43 | 0.87 | 0 | 0 | 0 | 0 | 4.55 | 0 | 0 | 73.16 | 0 |
○Thread 2419677 | 2.31 | 0 | 0 | 21 | 1.08 | 0 | 0 | 0 | 0 | 3.25 | 0 | 0 | 74.68 | 0 |
○Thread 2419678 | 2.31 | 0 | 0 | 21.86 | 0.87 | 0 | 0 | 0 | 0 | 3.03 | 0 | 0 | 74.24 | 0 |
○Thread 2419679 | 2.31 | 0 | 0 | 22.51 | 0.22 | 0 | 0 | 0 | 0 | 2.6 | 0 | 0 | 74.68 | 0 |
○Thread 2419680 | 2.31 | 0 | 0 | 21.65 | 1.3 | 0 | 0 | 0 | 0 | 3.9 | 0 | 0 | 73.16 | 0 |
○Thread 2419681 | 2.31 | 0 | 0 | 20.56 | 1.73 | 0 | 0 | 0 | 0 | 3.25 | 0 | 0 | 74.46 | 0 |
○Thread 2419682 | 2.31 | 0 | 0 | 21 | 0.43 | 0.22 | 0 | 0 | 0 | 4.55 | 0 | 0 | 73.81 | 0 |
▼run_6 | 1.87 | 0 | 0 | 30.92 | 0.73 | 0.11 | 0 | 0 | 0 | 7.2 | 0 | 0 | 61.04 | 0.01 |
▼Node skylake | 1.87 | 0 | 0 | 30.92 | 0.73 | 0.11 | 0 | 0 | 0 | 7.2 | 0 | 0 | 61.04 | 0.01 |
▼Process 2419687 | 1.87 | 0 | 0 | 30.92 | 0.73 | 0.11 | 0 | 0 | 0 | 7.2 | 0 | 0 | 61.04 | 0.01 |
○Thread 2419687 | 1.87 | 0 | 0 | 4.28 | 1.07 | 0.8 | 0 | 0 | 0 | 44.12 | 0 | 0 | 49.47 | 0.27 |
○Thread 2419740 | 1.5 | 0 | 0 | 34.11 | 1 | 0 | 0 | 0 | 0 | 5.69 | 0 | 0 | 59.2 | 0 |
○Thread 2419741 | 1.51 | 0 | 0 | 33.11 | 0.99 | 0 | 0 | 0 | 0 | 6.29 | 0 | 0 | 59.6 | 0 |
○Thread 2419742 | 1.5 | 0 | 0 | 32.56 | 0 | 0 | 0 | 0 | 0 | 7.97 | 0 | 0 | 59.47 | 0 |
○Thread 2419743 | 1.53 | 0 | 0 | 32.68 | 0.65 | 0 | 0 | 0 | 0 | 4.9 | 0 | 0 | 61.76 | 0 |
○Thread 2419744 | 1.52 | 0 | 0 | 31.91 | 0.66 | 0.33 | 0 | 0 | 0 | 6.25 | 0 | 0 | 60.86 | 0 |
○Thread 2419745 | 1.52 | 0 | 0 | 31.91 | 0.33 | 0 | 0 | 0 | 0 | 5.59 | 0 | 0 | 62.17 | 0 |
○Thread 2419746 | 1.49 | 0 | 0 | 32.21 | 0.34 | 0 | 0 | 0 | 0 | 9.06 | 0 | 0 | 58.39 | 0 |
○Thread 2419747 | 1.49 | 0 | 0 | 32.55 | 1.01 | 0 | 0 | 0 | 0 | 6.71 | 0 | 0 | 59.73 | 0 |
○Thread 2419748 | 1.52 | 0 | 0 | 31.02 | 0.33 | 0.33 | 0 | 0 | 0 | 5.94 | 0 | 0 | 62.38 | 0 |
○Thread 2419749 | 1.51 | 0 | 0 | 29.9 | 0.66 | 0 | 0 | 0 | 0 | 4.98 | 0 | 0 | 64.45 | 0 |
○Thread 2419750 | 1.52 | 0 | 0 | 31.68 | 0.66 | 0 | 0 | 0 | 0 | 4.62 | 0 | 0 | 63.04 | 0 |
○Thread 2419751 | 1.5 | 0 | 0 | 31 | 0.33 | 0 | 0 | 0 | 0 | 9.33 | 0 | 0 | 59.33 | 0 |
○Thread 2419752 | 1.52 | 0 | 0 | 30.69 | 1.32 | 0 | 0 | 0 | 0 | 4.95 | 0 | 0 | 63.04 | 0 |
○Thread 2419753 | 1.51 | 0 | 0 | 31.56 | 1 | 0 | 0 | 0 | 0 | 4.32 | 0 | 0 | 63.12 | 0 |
○Thread 2419754 | 1.49 | 0 | 0 | 31.54 | 0.34 | 0 | 0 | 0 | 0 | 6.04 | 0 | 0 | 62.08 | 0 |
○Thread 2419755 | 1.51 | 0 | 0 | 32.45 | 1.32 | 0.33 | 0 | 0 | 0 | 3.31 | 0 | 0 | 62.58 | 0 |
○Thread 2419756 | 1.5 | 0 | 0 | 32.33 | 0.67 | 0 | 0 | 0 | 0 | 9 | 0 | 0 | 58 | 0 |
○Thread 2419757 | 1.54 | 0 | 0 | 33.12 | 0 | 0 | 0 | 0 | 0 | 8.77 | 0 | 0 | 58.12 | 0 |
○Thread 2419758 | 1.5 | 0 | 0 | 31.67 | 0 | 0 | 0 | 0 | 0 | 6 | 0 | 0 | 62.33 | 0 |
○Thread 2419759 | 1.54 | 0 | 0 | 32.57 | 0.33 | 0 | 0 | 0 | 0 | 6.84 | 0 | 0 | 60.26 | 0 |
○Thread 2419760 | 1.5 | 0 | 0 | 30.67 | 0 | 0 | 0 | 0 | 0 | 9 | 0 | 0 | 60.33 | 0 |
○Thread 2419761 | 1.5 | 0 | 0 | 31.1 | 1.67 | 0 | 0 | 0 | 0 | 7.02 | 0 | 0 | 60.2 | 0 |
○Thread 2419762 | 1.54 | 0 | 0 | 31.49 | 0.65 | 0 | 0 | 0 | 0 | 8.12 | 0 | 0 | 59.74 | 0 |
○Thread 2419763 | 1.5 | 0 | 0 | 30.67 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 65.33 | 0 |
○Thread 2419764 | 1.49 | 0 | 0 | 29.63 | 1.35 | 0 | 0 | 0 | 0 | 5.72 | 0 | 0 | 63.3 | 0 |
○Thread 2419765 | 1.54 | 0 | 0 | 32.57 | 0 | 0.33 | 0 | 0 | 0 | 6.19 | 0 | 0 | 60.91 | 0 |
○Thread 2419766 | 1.54 | 0 | 0 | 32.25 | 0.98 | 0.33 | 0 | 0 | 0 | 4.23 | 0 | 0 | 62.21 | 0 |
○Thread 2419767 | 1.54 | 0 | 0 | 32.14 | 0.97 | 0 | 0 | 0 | 0 | 7.14 | 0 | 0 | 59.74 | 0 |
○Thread 2419768 | 1.53 | 0 | 0 | 31.48 | 0.66 | 0 | 0 | 0 | 0 | 4.59 | 0 | 0 | 63.28 | 0 |
○Thread 2419769 | 1.52 | 0 | 0 | 31.02 | 0.66 | 0.33 | 0 | 0 | 0 | 6.6 | 0 | 0 | 61.39 | 0 |
○Thread 2419770 | 1.51 | 0 | 0 | 31.46 | 0.66 | 0.33 | 0 | 0 | 0 | 8.28 | 0 | 0 | 59.27 | 0 |
○Thread 2419771 | 1.55 | 0 | 0 | 32.69 | 0.32 | 0 | 0 | 0 | 0 | 7.44 | 0 | 0 | 59.55 | 0 |
○Thread 2419772 | 1.51 | 0 | 0 | 30.79 | 0.66 | 0 | 0 | 0 | 0 | 2.98 | 0 | 0 | 65.56 | 0 |
○Thread 2419773 | 1.51 | 0 | 0 | 31.13 | 0.99 | 0 | 0 | 0 | 0 | 5.63 | 0 | 0 | 62.25 | 0 |
○Thread 2419774 | 1.55 | 0 | 0 | 30.74 | 1.29 | 0.32 | 0 | 0 | 0 | 3.56 | 0 | 0 | 64.08 | 0 |
○Thread 2419775 | 1.5 | 0 | 0 | 29.67 | 2 | 0 | 0 | 0 | 0 | 4.67 | 0 | 0 | 63.67 | 0 |
○Thread 2419776 | 1.53 | 0 | 0 | 31.37 | 1.31 | 0.33 | 0 | 0 | 0 | 6.21 | 0 | 0 | 60.78 | 0 |
○Thread 2419777 | 1.49 | 0 | 0 | 29.19 | 1.34 | 1.34 | 0 | 0 | 0 | 5.7 | 0 | 0 | 62.42 | 0 |
○Thread 2419778 | 1.49 | 0 | 0 | 30.64 | 0.67 | 0.34 | 0 | 0 | 0 | 9.09 | 0 | 0 | 59.26 | 0 |
○Thread 2419779 | 1.5 | 0 | 0 | 30.67 | 1.33 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 63 | 0 |
○Thread 2419780 | 1.53 | 0 | 0 | 32.57 | 0.98 | 0 | 0 | 0 | 0 | 2.61 | 0 | 0 | 63.84 | 0 |
○Thread 2419781 | 1.5 | 0 | 0 | 32.89 | 0.33 | 0 | 0 | 0 | 0 | 8.97 | 0 | 0 | 57.81 | 0 |
○Thread 2419782 | 1.52 | 0 | 0 | 32.46 | 0.66 | 0 | 0 | 0 | 0 | 9.51 | 0 | 0 | 57.38 | 0 |
○Thread 2419783 | 1.55 | 0 | 0 | 33.01 | 0.97 | 0 | 0 | 0 | 0 | 6.47 | 0 | 0 | 59.55 | 0 |
○Thread 2419784 | 1.51 | 0 | 0 | 30.9 | 0.33 | 0 | 0 | 0 | 0 | 7.64 | 0 | 0 | 61.13 | 0 |
○Thread 2419785 | 1.51 | 0 | 0 | 30.46 | 0.99 | 0 | 0 | 0 | 0 | 5.3 | 0 | 0 | 63.25 | 0 |
○Thread 2419786 | 1.51 | 0 | 0 | 31.46 | 0.99 | 0 | 0 | 0 | 0 | 7.28 | 0 | 0 | 60.26 | 0 |
○Thread 2419787 | 1.5 | 0 | 0 | 30.33 | 1 | 0 | 0 | 0 | 0 | 9.33 | 0 | 0 | 59.33 | 0 |
○Thread 2419788 | 1.55 | 0 | 0 | 32.36 | 0 | 0 | 0 | 0 | 0 | 5.5 | 0 | 0 | 62.14 | 0 |
○Thread 2419789 | 1.52 | 0 | 0 | 30.69 | 0.66 | 0.33 | 0 | 0 | 0 | 6.93 | 0 | 0 | 61.39 | 0 |
○Thread 2419790 | 1.54 | 0 | 0 | 30.29 | 0.33 | 0 | 0 | 0 | 0 | 4.56 | 0 | 0 | 64.82 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0.81 | 0.06 | 5.54 | 93.6 |
run_1 | 2 | 2.38 | 0.94 | 0.04 | 5.97 | 90.66 |
run_2 | 4 | 10.51 | 0.87 | 0.03 | 6.18 | 82.4 |
run_3 | 8 | 10.62 | 0.82 | 0.1 | 7.08 | 81.37 |
run_4 | 16 | 16.8 | 0.77 | 0.1 | 5.87 | 76.46 |
run_5 | 26 | 21.19 | 0.78 | 0.06 | 5.23 | 72.74 |
run_6 | 52 | 30.92 | 0.73 | 0.11 | 7.2 | 61.04 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
run_0 | 1 | 52.48 | 0 | 0.43 | 0.03 | 2.91 | 49.11 |
run_1 | 2 | 24.14 | 0.57 | 0.23 | 0.01 | 1.44 | 21.89 |
run_2 | 4 | 13.3 | 1.4 | 0.12 | 0 | 0.82 | 10.96 |
run_3 | 8 | 6.75 | 0.72 | 0.06 | 0.01 | 0.48 | 5.49 |
run_4 | 16 | 3.72 | 0.62 | 0.03 | 0 | 0.22 | 2.84 |
run_5 | 26 | 2.65 | 0.56 | 0.02 | 0 | 0.14 | 1.93 |
run_6 | 52 | 1.87 | 0.58 | 0.01 | 0 | 0.13 | 1.14 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_0 | 1 | 1 |
run_1 | 2 | 1.02 |
run_2 | 4 | 0.89 |
run_3 | 8 | 0.81 |
run_4 | 16 | 0.55 |
run_5 | 26 | 0.48 |
run_6 | 52 | 0.29 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0.06 | 0 | 97.5 | 2.41 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.08 | 5.1 | 84.22 | 10.6 |
run_3 | 8 | 0 | 0 | 0.08 | 0.1 | 0 | 0 | 0 | 6.04 | 1.59 | 81.47 | 10.72 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.9 | 78.05 | 17.05 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.89 | 4.34 | 69.36 | 21.41 |
run_6 | 52 | 0 | 0 | 0 | 6.51 | 3.94 | 0 | 0 | 1.94 | 55.83 | 0.67 | 31.11 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2.44 | 97.56 | 0 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10.6 | 89.4 | 0 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10.73 | 89.28 | 0 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17.05 | 82.95 | 0 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21.42 | 78.59 | 0 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31.16 | 68.89 | 0 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.67 | 9.33 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.29 | 0 | 72.92 | 20.78 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.04 | 0 | 65.83 | 28.14 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 5.3 | 0 | 0 | 52.31 | 42.39 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 4.33 | 0 | 0 | 41.6 | 54.07 |
run_6 | 52 | 0 | 0 | 0 | 5.27 | 0 | 1 | 0 | 0 | 0 | 27.35 | 66.38 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.67 | 9.33 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 79.22 | 20.78 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 71.86 | 28.14 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57.61 | 42.39 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45.93 | 54.07 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33.62 | 66.38 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_0 | run_1 | run_2 | run_3 | run_4 | run_5 | run_6 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |