Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 42.69 | 0 | 0 | 0 | 0.78 | 0.02 | 0 | 0 | 0 | 4.38 | 0 | 0 | 94.81 | 0 |
▼Node skylake | 42.69 | 0 | 0 | 0 | 0.78 | 0.02 | 0 | 0 | 0 | 4.38 | 0 | 0 | 94.81 | 0 |
▼Process 2522308 | 42.69 | 0 | 0 | 0 | 0.78 | 0.02 | 0 | 0 | 0 | 4.38 | 0 | 0 | 94.81 | 0 |
○Thread 2522308 | 42.69 | 0 | 0 | 0 | 0.78 | 0.02 | 0 | 0 | 0 | 4.38 | 0 | 0 | 94.81 | 0 |
▼m1o2 | 18.63 | 0 | 0 | 0.92 | 1.03 | 0.05 | 0 | 0 | 0 | 4.49 | 0 | 0 | 93.51 | 0 |
▼Node skylake | 18.63 | 0 | 0 | 0.92 | 1.03 | 0.05 | 0 | 0 | 0 | 4.49 | 0 | 0 | 93.51 | 0 |
▼Process 2522369 | 18.63 | 0 | 0 | 0.92 | 1.03 | 0.05 | 0 | 0 | 0 | 4.49 | 0 | 0 | 93.51 | 0 |
○Thread 2522369 | 18.63 | 0 | 0 | 1.61 | 1.18 | 0.05 | 0 | 0 | 0 | 4.27 | 0 | 0 | 92.89 | 0 |
○Thread 2522423 | 18.42 | 0 | 0 | 0.22 | 0.87 | 0.05 | 0 | 0 | 0 | 4.72 | 0 | 0 | 94.14 | 0 |
▼m1o4 | 9.9 | 0 | 0 | 7.52 | 1.08 | 0.03 | 0 | 0 | 0 | 5.02 | 0 | 0 | 86.36 | 0 |
▼Node skylake | 9.9 | 0 | 0 | 7.52 | 1.08 | 0.03 | 0 | 0 | 0 | 5.02 | 0 | 0 | 86.36 | 0 |
▼Process 2522430 | 9.9 | 0 | 0 | 7.52 | 1.08 | 0.03 | 0 | 0 | 0 | 5.02 | 0 | 0 | 86.36 | 0 |
○Thread 2522430 | 9.9 | 0 | 0 | 7.17 | 1.31 | 0 | 0 | 0 | 0 | 4.9 | 0 | 0 | 86.62 | 0 |
○Thread 2522484 | 9.51 | 0 | 0 | 7.68 | 1 | 0.05 | 0 | 0 | 0 | 4.36 | 0 | 0 | 86.91 | 0 |
○Thread 2522485 | 9.89 | 0 | 0 | 8.64 | 1.06 | 0.05 | 0 | 0 | 0 | 5.31 | 0 | 0 | 84.94 | 0 |
○Thread 2522486 | 9.57 | 0 | 0 | 6.58 | 0.94 | 0 | 0 | 0 | 0 | 5.48 | 0 | 0 | 87 | 0 |
▼m1o8 | 4.93 | 0 | 0 | 5.83 | 0.89 | 0.03 | 0 | 0 | 0 | 6.05 | 0 | 0 | 87.21 | 0 |
▼Node skylake | 4.93 | 0 | 0 | 5.83 | 0.89 | 0.03 | 0 | 0 | 0 | 6.05 | 0 | 0 | 87.21 | 0 |
▼Process 2522493 | 4.93 | 0 | 0 | 5.83 | 0.89 | 0.03 | 0 | 0 | 0 | 6.05 | 0 | 0 | 87.21 | 0 |
○Thread 2522493 | 4.93 | 0 | 0 | 7.1 | 0.81 | 0.1 | 0 | 0 | 0 | 4.97 | 0 | 0 | 87.02 | 0 |
○Thread 2522547 | 4.81 | 0 | 0 | 1.04 | 1.04 | 0 | 0 | 0 | 0 | 7.59 | 0 | 0 | 90.33 | 0 |
○Thread 2522548 | 4.82 | 0 | 0 | 4.25 | 0.83 | 0.1 | 0 | 0 | 0 | 7.68 | 0 | 0 | 87.14 | 0 |
○Thread 2522549 | 4.82 | 0 | 0 | 6.02 | 0.52 | 0 | 0 | 0 | 0 | 7.37 | 0 | 0 | 86.09 | 0 |
○Thread 2522550 | 4.81 | 0 | 0 | 8.42 | 0.83 | 0 | 0 | 0 | 0 | 6.03 | 0 | 0 | 84.72 | 0 |
○Thread 2522551 | 4.92 | 0 | 0 | 6.2 | 0.71 | 0 | 0 | 0 | 0 | 5.18 | 0 | 0 | 87.91 | 0 |
○Thread 2522552 | 4.91 | 0 | 0 | 8.45 | 1.12 | 0 | 0 | 0 | 0 | 4.79 | 0 | 0 | 85.64 | 0 |
○Thread 2522553 | 4.92 | 0 | 0 | 5.08 | 1.22 | 0 | 0 | 0 | 0 | 4.88 | 0 | 0 | 88.82 | 0 |
▼m1o16 | 2.51 | 0 | 0 | 6.41 | 0.89 | 0.04 | 0 | 0 | 0 | 5.15 | 0 | 0 | 87.51 | 0 |
▼Node skylake | 2.51 | 0 | 0 | 6.41 | 0.89 | 0.04 | 0 | 0 | 0 | 5.15 | 0 | 0 | 87.51 | 0 |
▼Process 2522558 | 2.51 | 0 | 0 | 6.41 | 0.89 | 0.04 | 0 | 0 | 0 | 5.15 | 0 | 0 | 87.51 | 0 |
○Thread 2522558 | 2.51 | 0 | 0 | 6.18 | 0.6 | 0 | 0 | 0 | 0 | 4.78 | 0 | 0 | 88.45 | 0 |
○Thread 2522612 | 2.42 | 0 | 0 | 4.14 | 1.66 | 0 | 0 | 0 | 0 | 4.97 | 0 | 0 | 89.23 | 0 |
○Thread 2522613 | 2.42 | 0 | 0 | 5.17 | 0.83 | 0.21 | 0 | 0 | 0 | 8.06 | 0 | 0 | 85.74 | 0 |
○Thread 2522614 | 2.48 | 0 | 0 | 5.25 | 0.61 | 0 | 0 | 0 | 0 | 6.87 | 0 | 0 | 87.27 | 0 |
○Thread 2522615 | 2.47 | 0 | 0 | 7.29 | 0.81 | 0 | 0 | 0 | 0 | 4.86 | 0 | 0 | 87.04 | 0 |
○Thread 2522616 | 2.48 | 0 | 0 | 5.25 | 0.61 | 0 | 0 | 0 | 0 | 5.66 | 0 | 0 | 88.48 | 0 |
○Thread 2522617 | 2.48 | 0 | 0 | 5.44 | 0.81 | 0 | 0 | 0 | 0 | 5.44 | 0 | 0 | 88.31 | 0 |
○Thread 2522618 | 2.48 | 0 | 0 | 4.23 | 1.01 | 0 | 0 | 0 | 0 | 4.84 | 0 | 0 | 89.92 | 0 |
○Thread 2522619 | 2.49 | 0 | 0 | 7.63 | 0.4 | 0 | 0 | 0 | 0 | 4.22 | 0 | 0 | 87.75 | 0 |
○Thread 2522620 | 2.5 | 0 | 0 | 7.4 | 0.8 | 0 | 0 | 0 | 0 | 3.8 | 0 | 0 | 88 | 0 |
○Thread 2522621 | 2.5 | 0 | 0 | 6.8 | 0.6 | 0 | 0 | 0 | 0 | 5.2 | 0 | 0 | 87.4 | 0 |
○Thread 2522622 | 2.5 | 0 | 0 | 8.82 | 1.2 | 0 | 0 | 0 | 0 | 5.81 | 0 | 0 | 84.17 | 0 |
○Thread 2522623 | 2.5 | 0 | 0 | 8.62 | 1.6 | 0 | 0 | 0 | 0 | 4.21 | 0 | 0 | 85.57 | 0 |
○Thread 2522624 | 2.5 | 0 | 0 | 6.4 | 0.4 | 0.2 | 0 | 0 | 0 | 5.2 | 0 | 0 | 87.8 | 0 |
○Thread 2522625 | 2.5 | 0 | 0 | 5.6 | 1.8 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 89.6 | 0 |
○Thread 2522626 | 2.5 | 0 | 0 | 8.22 | 0.6 | 0.2 | 0 | 0 | 0 | 5.61 | 0 | 0 | 85.37 | 0 |
▼m1o26 | 1.59 | 0 | 0 | 7.07 | 1.1 | 0.06 | 0 | 0 | 0 | 4.59 | 0 | 0 | 87.19 | 0 |
▼Node skylake | 1.59 | 0 | 0 | 7.07 | 1.1 | 0.06 | 0 | 0 | 0 | 4.59 | 0 | 0 | 87.19 | 0 |
▼Process 2522633 | 1.59 | 0 | 0 | 7.07 | 1.1 | 0.06 | 0 | 0 | 0 | 4.59 | 0 | 0 | 87.19 | 0 |
○Thread 2522633 | 1.59 | 0 | 0 | 5.33 | 1.25 | 0 | 0 | 0 | 0 | 6.27 | 0 | 0 | 87.15 | 0 |
○Thread 2522687 | 1.54 | 0 | 0 | 8.41 | 1.29 | 0 | 0 | 0 | 0 | 4.53 | 0 | 0 | 85.76 | 0 |
○Thread 2522688 | 1.55 | 0 | 0 | 9.03 | 1.94 | 0 | 0 | 0 | 0 | 4.52 | 0 | 0 | 84.52 | 0 |
○Thread 2522689 | 1.58 | 0 | 0 | 13.65 | 1.27 | 0.32 | 0 | 0 | 0 | 5.71 | 0 | 0 | 79.05 | 0 |
○Thread 2522690 | 1.57 | 0 | 0 | 13.06 | 0.64 | 0 | 0 | 0 | 0 | 4.78 | 0 | 0 | 81.53 | 0 |
○Thread 2522691 | 1.58 | 0 | 0 | 8.86 | 2.53 | 0 | 0 | 0 | 0 | 5.7 | 0 | 0 | 82.91 | 0 |
○Thread 2522692 | 1.58 | 0 | 0 | 5.99 | 1.58 | 0 | 0 | 0 | 0 | 4.42 | 0 | 0 | 88.01 | 0 |
○Thread 2522693 | 1.58 | 0 | 0 | 5.99 | 0.95 | 0 | 0 | 0 | 0 | 1.89 | 0 | 0 | 91.17 | 0 |
○Thread 2522694 | 1.58 | 0 | 0 | 5.99 | 1.26 | 0 | 0 | 0 | 0 | 5.36 | 0 | 0 | 87.38 | 0 |
○Thread 2522695 | 1.59 | 0 | 0 | 6.6 | 1.89 | 0 | 0 | 0 | 0 | 3.46 | 0 | 0 | 88.05 | 0 |
○Thread 2522696 | 1.58 | 0 | 0 | 6.31 | 1.26 | 0.32 | 0 | 0 | 0 | 6.62 | 0 | 0 | 85.49 | 0 |
○Thread 2522697 | 1.58 | 0 | 0 | 5.68 | 1.26 | 0 | 0 | 0 | 0 | 3.15 | 0 | 0 | 89.91 | 0 |
○Thread 2522698 | 1.59 | 0 | 0 | 5.35 | 0.63 | 0 | 0 | 0 | 0 | 5.03 | 0 | 0 | 88.99 | 0 |
○Thread 2522699 | 1.58 | 0 | 0 | 5.99 | 1.26 | 0.32 | 0 | 0 | 0 | 5.36 | 0 | 0 | 87.07 | 0 |
○Thread 2522700 | 1.58 | 0 | 0 | 5.99 | 0.95 | 0 | 0 | 0 | 0 | 3.47 | 0 | 0 | 89.59 | 0 |
○Thread 2522701 | 1.58 | 0 | 0 | 7.57 | 0.63 | 0 | 0 | 0 | 0 | 4.73 | 0 | 0 | 87.07 | 0 |
○Thread 2522702 | 1.58 | 0 | 0 | 6.94 | 0.63 | 0 | 0 | 0 | 0 | 4.42 | 0 | 0 | 88.01 | 0 |
○Thread 2522703 | 1.59 | 0 | 0 | 6.6 | 1.26 | 0 | 0 | 0 | 0 | 2.83 | 0 | 0 | 89.31 | 0 |
○Thread 2522704 | 1.58 | 0 | 0 | 5.36 | 1.26 | 0 | 0 | 0 | 0 | 3.79 | 0 | 0 | 89.59 | 0 |
○Thread 2522705 | 1.59 | 0 | 0 | 5.97 | 0.63 | 0.31 | 0 | 0 | 0 | 3.77 | 0 | 0 | 89.31 | 0 |
○Thread 2522706 | 1.58 | 0 | 0 | 5.68 | 0.95 | 0 | 0 | 0 | 0 | 6.94 | 0 | 0 | 86.44 | 0 |
○Thread 2522707 | 1.58 | 0 | 0 | 6.31 | 0 | 0 | 0 | 0 | 0 | 4.1 | 0 | 0 | 89.59 | 0 |
○Thread 2522708 | 1.58 | 0 | 0 | 6.62 | 0.63 | 0.32 | 0 | 0 | 0 | 4.73 | 0 | 0 | 87.7 | 0 |
○Thread 2522709 | 1.58 | 0 | 0 | 5.99 | 1.26 | 0 | 0 | 0 | 0 | 4.42 | 0 | 0 | 88.33 | 0 |
○Thread 2522710 | 1.58 | 0 | 0 | 5.05 | 0.95 | 0 | 0 | 0 | 0 | 3.15 | 0 | 0 | 90.85 | 0 |
○Thread 2522711 | 1.54 | 0 | 0 | 9.71 | 0.32 | 0 | 0 | 0 | 0 | 6.15 | 0 | 0 | 83.82 | 0 |
▼m1o52 | 0.92 | 0 | 0 | 5.58 | 0.83 | 0.11 | 0 | 0 | 0 | 8.14 | 0 | 0 | 85.34 | 0 |
▼Node skylake | 0.92 | 0 | 0 | 5.58 | 0.83 | 0.11 | 0 | 0 | 0 | 8.14 | 0 | 0 | 85.34 | 0 |
▼Process 2522716 | 0.92 | 0 | 0 | 5.58 | 0.83 | 0.11 | 0 | 0 | 0 | 8.14 | 0 | 0 | 85.34 | 0 |
○Thread 2522716 | 0.92 | 0 | 0 | 2.72 | 1.63 | 0.54 | 0 | 0 | 0 | 7.07 | 0 | 0 | 88.04 | 0 |
○Thread 2522770 | 0.9 | 0 | 0 | 7.22 | 1.11 | 0 | 0 | 0 | 0 | 11.67 | 0 | 0 | 80 | 0 |
○Thread 2522771 | 0.92 | 0 | 0 | 6.01 | 1.64 | 0 | 0 | 0 | 0 | 4.37 | 0 | 0 | 87.98 | 0 |
○Thread 2522772 | 0.9 | 0 | 0 | 6.7 | 2.23 | 0 | 0 | 0 | 0 | 9.5 | 0 | 0 | 81.56 | 0 |
○Thread 2522773 | 0.9 | 0 | 0 | 2.78 | 0.56 | 0.56 | 0 | 0 | 0 | 10 | 0 | 0 | 86.11 | 0 |
○Thread 2522774 | 0.91 | 0 | 0 | 6.04 | 2.2 | 0 | 0 | 0 | 0 | 4.4 | 0 | 0 | 87.36 | 0 |
○Thread 2522775 | 0.9 | 0 | 0 | 5.59 | 0 | 0 | 0 | 0 | 0 | 10.06 | 0 | 0 | 84.36 | 0 |
○Thread 2522776 | 0.9 | 0 | 0 | 5.56 | 0.56 | 0 | 0 | 0 | 0 | 10.56 | 0 | 0 | 83.33 | 0 |
○Thread 2522777 | 0.92 | 0 | 0 | 6.01 | 1.09 | 0 | 0 | 0 | 0 | 3.83 | 0 | 0 | 89.07 | 0 |
○Thread 2522778 | 0.92 | 0 | 0 | 5.46 | 0.55 | 0 | 0 | 0 | 0 | 4.37 | 0 | 0 | 89.62 | 0 |
○Thread 2522779 | 0.91 | 0 | 0 | 7.73 | 0 | 0 | 0 | 0 | 0 | 12.71 | 0 | 0 | 79.56 | 0 |
○Thread 2522780 | 0.91 | 0 | 0 | 5.52 | 1.1 | 0 | 0 | 0 | 0 | 10.5 | 0 | 0 | 82.87 | 0 |
○Thread 2522781 | 0.92 | 0 | 0 | 4.92 | 1.09 | 0 | 0 | 0 | 0 | 4.37 | 0 | 0 | 89.62 | 0 |
○Thread 2522782 | 0.91 | 0 | 0 | 3.85 | 0.55 | 0 | 0 | 0 | 0 | 9.34 | 0 | 0 | 86.26 | 0 |
○Thread 2522783 | 0.91 | 0 | 0 | 6.04 | 0 | 0 | 0 | 0 | 0 | 8.24 | 0 | 0 | 85.71 | 0 |
○Thread 2522784 | 0.91 | 0 | 0 | 4.95 | 1.65 | 0 | 0 | 0 | 0 | 4.4 | 0 | 0 | 89.01 | 0 |
○Thread 2522785 | 0.91 | 0 | 0 | 8.24 | 0.55 | 0.55 | 0 | 0 | 0 | 10.44 | 0 | 0 | 80.22 | 0 |
○Thread 2522786 | 0.92 | 0 | 0 | 6.56 | 0 | 0.55 | 0 | 0 | 0 | 8.74 | 0 | 0 | 84.15 | 0 |
○Thread 2522787 | 0.92 | 0 | 0 | 6.01 | 1.09 | 0 | 0 | 0 | 0 | 6.01 | 0 | 0 | 86.89 | 0 |
○Thread 2522788 | 0.91 | 0 | 0 | 6.59 | 2.75 | 0 | 0 | 0 | 0 | 8.24 | 0 | 0 | 82.42 | 0 |
○Thread 2522789 | 0.92 | 0 | 0 | 6.56 | 0 | 0 | 0 | 0 | 0 | 10.93 | 0 | 0 | 82.51 | 0 |
○Thread 2522790 | 0.92 | 0 | 0 | 4.37 | 0.55 | 0 | 0 | 0 | 0 | 9.29 | 0 | 0 | 85.79 | 0 |
○Thread 2522791 | 0.91 | 0 | 0 | 3.85 | 1.1 | 0.55 | 0 | 0 | 0 | 4.4 | 0 | 0 | 90.11 | 0 |
○Thread 2522792 | 0.91 | 0 | 0 | 6.04 | 0.55 | 0 | 0 | 0 | 0 | 7.69 | 0 | 0 | 85.71 | 0 |
○Thread 2522793 | 0.92 | 0 | 0 | 5.46 | 1.09 | 0 | 0 | 0 | 0 | 10.93 | 0 | 0 | 82.51 | 0 |
○Thread 2522794 | 0.91 | 0 | 0 | 4.95 | 0.55 | 0 | 0 | 0 | 0 | 4.4 | 0 | 0 | 90.11 | 0 |
○Thread 2522795 | 0.91 | 0 | 0 | 4.42 | 0 | 0 | 0 | 0 | 0 | 9.39 | 0 | 0 | 86.19 | 0 |
○Thread 2522796 | 0.92 | 0 | 0 | 7.65 | 1.64 | 0 | 0 | 0 | 0 | 10.38 | 0 | 0 | 80.33 | 0 |
○Thread 2522797 | 0.92 | 0 | 0 | 6.56 | 1.09 | 0 | 0 | 0 | 0 | 7.65 | 0 | 0 | 84.7 | 0 |
○Thread 2522798 | 0.91 | 0 | 0 | 8.24 | 0 | 0 | 0 | 0 | 0 | 6.59 | 0 | 0 | 85.16 | 0 |
○Thread 2522799 | 0.92 | 0 | 0 | 4.92 | 0.55 | 0.55 | 0 | 0 | 0 | 6.01 | 0 | 0 | 87.98 | 0 |
○Thread 2522800 | 0.92 | 0 | 0 | 4.37 | 0 | 0 | 0 | 0 | 0 | 7.1 | 0 | 0 | 88.52 | 0 |
○Thread 2522801 | 0.91 | 0 | 0 | 5.49 | 1.65 | 0.55 | 0 | 0 | 0 | 6.04 | 0 | 0 | 86.26 | 0 |
○Thread 2522802 | 0.92 | 0 | 0 | 4.37 | 0 | 0 | 0 | 0 | 0 | 5.46 | 0 | 0 | 90.16 | 0 |
○Thread 2522803 | 0.92 | 0 | 0 | 6.01 | 0.55 | 0.55 | 0 | 0 | 0 | 12.02 | 0 | 0 | 80.87 | 0 |
○Thread 2522804 | 0.92 | 0 | 0 | 4.92 | 0 | 0 | 0 | 0 | 0 | 12.02 | 0 | 0 | 83.06 | 0 |
○Thread 2522805 | 0.91 | 0 | 0 | 4.95 | 1.65 | 0 | 0 | 0 | 0 | 8.24 | 0 | 0 | 85.16 | 0 |
○Thread 2522806 | 0.91 | 0 | 0 | 8.79 | 1.1 | 0 | 0 | 0 | 0 | 10.99 | 0 | 0 | 79.12 | 0 |
○Thread 2522807 | 0.9 | 0 | 0 | 2.79 | 0 | 0 | 0 | 0 | 0 | 8.38 | 0 | 0 | 88.83 | 0 |
○Thread 2522808 | 0.92 | 0 | 0 | 3.83 | 0 | 0 | 0 | 0 | 0 | 10.93 | 0 | 0 | 85.25 | 0 |
○Thread 2522809 | 0.91 | 0 | 0 | 4.95 | 1.65 | 0.55 | 0 | 0 | 0 | 8.79 | 0 | 0 | 84.07 | 0 |
○Thread 2522810 | 0.91 | 0 | 0 | 4.42 | 1.1 | 0 | 0 | 0 | 0 | 12.71 | 0 | 0 | 81.77 | 0 |
○Thread 2522811 | 0.92 | 0 | 0 | 8.2 | 0 | 0 | 0 | 0 | 0 | 7.1 | 0 | 0 | 84.7 | 0 |
○Thread 2522812 | 0.92 | 0 | 0 | 6.56 | 0 | 0.55 | 0 | 0 | 0 | 6.01 | 0 | 0 | 86.89 | 0 |
○Thread 2522813 | 0.91 | 0 | 0 | 6.59 | 0.55 | 0 | 0 | 0 | 0 | 6.04 | 0 | 0 | 86.81 | 0 |
○Thread 2522814 | 0.92 | 0 | 0 | 5.46 | 1.09 | 0 | 0 | 0 | 0 | 8.2 | 0 | 0 | 85.25 | 0 |
○Thread 2522815 | 0.92 | 0 | 0 | 5.46 | 0.55 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 89.07 | 0 |
○Thread 2522816 | 0.91 | 0 | 0 | 7.73 | 1.66 | 0 | 0 | 0 | 0 | 11.05 | 0 | 0 | 79.56 | 0 |
○Thread 2522817 | 0.92 | 0 | 0 | 4.37 | 0.55 | 0 | 0 | 0 | 0 | 3.83 | 0 | 0 | 91.26 | 0 |
○Thread 2522818 | 0.91 | 0 | 0 | 4.95 | 0.55 | 0 | 0 | 0 | 0 | 9.34 | 0 | 0 | 85.16 | 0 |
○Thread 2522819 | 0.91 | 0 | 0 | 4.37 | 1.09 | 0 | 0 | 0 | 0 | 9.29 | 0 | 0 | 85.25 | 0 |
○Thread 2522820 | 0.91 | 0 | 0 | 3.85 | 2.2 | 0 | 0 | 0 | 0 | 8.79 | 0 | 0 | 85.16 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0.78 | 0.02 | 4.38 | 94.81 |
m1o2 | 2 | 0.92 | 1.03 | 0.05 | 4.49 | 93.51 |
m1o4 | 4 | 7.52 | 1.08 | 0.03 | 5.02 | 86.36 |
m1o8 | 8 | 5.83 | 0.89 | 0.03 | 6.05 | 87.21 |
m1o16 | 16 | 6.41 | 0.89 | 0.04 | 5.15 | 87.51 |
m1o26 | 26 | 7.07 | 1.1 | 0.06 | 4.59 | 87.19 |
m1o52 | 52 | 5.58 | 0.83 | 0.11 | 8.14 | 85.34 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|
m1o1 | 1 | 42.69 | 0 | 0.33 | 1.87 | 40.47 |
m1o2 | 2 | 18.63 | 0.17 | 0.19 | 0.84 | 17.42 |
m1o4 | 4 | 9.9 | 0.74 | 0.11 | 0.5 | 8.55 |
m1o8 | 8 | 4.93 | 0.29 | 0.04 | 0.3 | 4.3 |
m1o16 | 16 | 2.51 | 0.16 | 0.02 | 0.13 | 2.2 |
m1o26 | 26 | 1.59 | 0.11 | 0.02 | 0.07 | 1.39 |
m1o52 | 52 | 0.92 | 0.05 | 0.01 | 0.07 | 0.79 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 1.07 |
m1o4 | 4 | 0.92 |
m1o8 | 8 | 0.8 |
m1o16 | 16 | 0.62 |
m1o26 | 26 | 0.48 |
m1o52 | 52 | 0.29 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0.15 | 0 | 0 | 0 | 2.19 | 96.73 | 0.93 |
m1o4 | 4 | 0 | 0 | 0.06 | 0 | 0.08 | 0 | 0 | 0 | 0.94 | 91.34 | 7.58 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.01 | 1.95 | 86.09 | 5.95 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.22 | 86.21 | 6.57 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0.99 | 0 | 0 | 0 | 91.7 | 7.31 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 5.36 | 8.88 | 2.4 | 0 | 0 | 77.58 | 5.78 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.07 | 98.92 | 0.01 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.63 | 92.36 | 0.01 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.95 | 94.05 | 0 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.57 | 93.43 | 0 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.3 | 92.69 | 0.01 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.77 | 94.22 | 0.01 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.69 | 9.31 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.53 | 79.84 | 14.63 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.68 | 0 | 66.71 | 27.61 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 5.46 | 0 | 0 | 51.88 | 42.67 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 4.84 | 0 | 0 | 1.05 | 39.52 | 54.59 |
m1o52 | 52 | 0 | 0 | 0 | 5.01 | 0 | 1.02 | 0 | 0 | 0 | 27.99 | 65.98 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.69 | 9.31 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 85.37 | 14.63 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 72.39 | 27.61 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57.33 | 42.67 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45.41 | 54.59 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34.02 | 65.98 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |