Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 19.25 | 0 | 0 | 0 | 2 | 0.03 | 0 | 0.03 | 0 | 4.91 | 0 | 0 | 93.04 | 0 |
▼Node skylake | 19.25 | 0 | 0 | 0 | 2 | 0.03 | 0 | 0.03 | 0 | 4.91 | 0 | 0 | 93.04 | 0 |
▼Process 236169 | 19.25 | 0 | 0 | 0 | 2 | 0.03 | 0 | 0.03 | 0 | 4.91 | 0 | 0 | 93.04 | 0 |
○Thread 236169 | 19.25 | 0 | 0 | 0 | 2 | 0.03 | 0 | 0.03 | 0 | 4.91 | 0 | 0 | 93.04 | 0 |
▼m1o2 | 9.41 | 0 | 0 | 0.93 | 2.29 | 0.05 | 0 | 0.03 | 0 | 5.54 | 0 | 0 | 91.13 | 0.03 |
▼Node skylake | 9.41 | 0 | 0 | 0.93 | 2.29 | 0.05 | 0 | 0.03 | 0 | 5.54 | 0 | 0 | 91.13 | 0.03 |
▼Process 236229 | 9.41 | 0 | 0 | 0.93 | 2.29 | 0.05 | 0 | 0.03 | 0 | 5.54 | 0 | 0 | 91.13 | 0.03 |
○Thread 236229 | 9.36 | 0 | 0 | 0.05 | 2.4 | 0.05 | 0 | 0.05 | 0 | 6.3 | 0 | 0 | 91.08 | 0.05 |
○Thread 236282 | 9.41 | 0 | 0 | 1.81 | 2.18 | 0.05 | 0 | 0 | 0 | 4.78 | 0 | 0 | 91.18 | 0 |
▼m1o4 | 5.02 | 0 | 0 | 6.08 | 2.33 | 0.1 | 0 | 0.05 | 0 | 5.08 | 0 | 0 | 86.35 | 0 |
▼Node skylake | 5.02 | 0 | 0 | 6.08 | 2.33 | 0.1 | 0 | 0.05 | 0 | 5.08 | 0 | 0 | 86.35 | 0 |
▼Process 236287 | 5.02 | 0 | 0 | 6.08 | 2.33 | 0.1 | 0 | 0.05 | 0 | 5.08 | 0 | 0 | 86.35 | 0 |
○Thread 236287 | 4.97 | 0 | 0 | 4.13 | 2.62 | 0.2 | 0 | 0.2 | 0 | 5.64 | 0 | 0 | 87.21 | 0 |
○Thread 236340 | 4.98 | 0 | 0 | 3.62 | 2.31 | 0.1 | 0 | 0 | 0 | 6.13 | 0 | 0 | 87.84 | 0 |
○Thread 236341 | 5.02 | 0 | 0 | 8.18 | 2.59 | 0 | 0 | 0 | 0 | 3.99 | 0 | 0 | 85.24 | 0 |
○Thread 236342 | 5.02 | 0 | 0 | 8.37 | 1.79 | 0.1 | 0 | 0 | 0 | 4.59 | 0 | 0 | 85.14 | 0 |
▼m1o8 | 2.57 | 0 | 0 | 8.47 | 1.78 | 0.1 | 0 | 0.02 | 0 | 5.32 | 0 | 0 | 84.3 | 0 |
▼Node skylake | 2.57 | 0 | 0 | 8.47 | 1.78 | 0.1 | 0 | 0.02 | 0 | 5.32 | 0 | 0 | 84.3 | 0 |
▼Process 236347 | 2.57 | 0 | 0 | 8.47 | 1.78 | 0.1 | 0 | 0.02 | 0 | 5.32 | 0 | 0 | 84.3 | 0 |
○Thread 236347 | 2.53 | 0 | 0 | 3.37 | 0.79 | 0.59 | 0 | 0.2 | 0 | 7.72 | 0 | 0 | 87.33 | 0 |
○Thread 236400 | 2.57 | 0 | 0 | 10.12 | 1.56 | 0 | 0 | 0 | 0 | 4.09 | 0 | 0 | 84.24 | 0 |
○Thread 236401 | 2.56 | 0 | 0 | 9.98 | 1.76 | 0 | 0 | 0 | 0 | 4.31 | 0 | 0 | 83.95 | 0 |
○Thread 236402 | 2.55 | 0 | 0 | 9 | 2.15 | 0 | 0 | 0 | 0 | 4.89 | 0 | 0 | 83.95 | 0 |
○Thread 236403 | 2.57 | 0 | 0 | 9.92 | 1.56 | 0 | 0 | 0 | 0 | 4.09 | 0 | 0 | 84.44 | 0 |
○Thread 236404 | 2.57 | 0 | 0 | 9.73 | 0.97 | 0.19 | 0 | 0 | 0 | 5.06 | 0 | 0 | 84.05 | 0 |
○Thread 236405 | 2.56 | 0 | 0 | 8.59 | 2.73 | 0 | 0 | 0 | 0 | 4.88 | 0 | 0 | 83.79 | 0 |
○Thread 236406 | 2.57 | 0 | 0 | 7 | 2.72 | 0 | 0 | 0 | 0 | 7.59 | 0 | 0 | 82.68 | 0 |
▼m1o16 | 1.35 | 0 | 0 | 12.88 | 2.06 | 0.07 | 0 | 0.02 | 0 | 5.59 | 0 | 0 | 79.35 | 0.02 |
▼Node skylake | 1.35 | 0 | 0 | 12.88 | 2.06 | 0.07 | 0 | 0.02 | 0 | 5.59 | 0 | 0 | 79.35 | 0.02 |
▼Process 236413 | 1.35 | 0 | 0 | 12.88 | 2.06 | 0.07 | 0 | 0.02 | 0 | 5.59 | 0 | 0 | 79.35 | 0.02 |
○Thread 236413 | 1.31 | 0 | 0 | 1.53 | 1.92 | 0.77 | 0 | 0.38 | 0 | 14.18 | 0 | 0 | 80.84 | 0.38 |
○Thread 236466 | 1.35 | 0 | 0 | 13.33 | 1.85 | 0 | 0 | 0 | 0 | 5.19 | 0 | 0 | 79.63 | 0 |
○Thread 236467 | 1.35 | 0 | 0 | 13.33 | 2.22 | 0 | 0 | 0 | 0 | 5.56 | 0 | 0 | 78.89 | 0 |
○Thread 236468 | 1.35 | 0 | 0 | 13.33 | 2.22 | 0 | 0 | 0 | 0 | 4.44 | 0 | 0 | 80 | 0 |
○Thread 236469 | 1.35 | 0 | 0 | 14.07 | 0.74 | 0 | 0 | 0 | 0 | 5.56 | 0 | 0 | 79.63 | 0 |
○Thread 236470 | 1.35 | 0 | 0 | 14.81 | 2.22 | 0 | 0 | 0 | 0 | 3.33 | 0 | 0 | 79.63 | 0 |
○Thread 236471 | 1.35 | 0 | 0 | 13.33 | 1.85 | 0 | 0 | 0 | 0 | 4.07 | 0 | 0 | 80.74 | 0 |
○Thread 236472 | 1.35 | 0 | 0 | 13.33 | 2.22 | 0 | 0 | 0 | 0 | 4.81 | 0 | 0 | 79.63 | 0 |
○Thread 236473 | 1.35 | 0 | 0 | 13.7 | 2.59 | 0 | 0 | 0 | 0 | 4.44 | 0 | 0 | 79.26 | 0 |
○Thread 236474 | 1.35 | 0 | 0 | 13.7 | 1.85 | 0 | 0 | 0 | 0 | 3.33 | 0 | 0 | 81.11 | 0 |
○Thread 236475 | 1.34 | 0 | 0 | 13.01 | 1.49 | 0 | 0 | 0 | 0 | 5.58 | 0 | 0 | 79.93 | 0 |
○Thread 236476 | 1.35 | 0 | 0 | 12.59 | 2.96 | 0 | 0 | 0 | 0 | 6.3 | 0 | 0 | 78.15 | 0 |
○Thread 236477 | 1.35 | 0 | 0 | 14.44 | 3.33 | 0 | 0 | 0 | 0 | 6.3 | 0 | 0 | 75.93 | 0 |
○Thread 236478 | 1.35 | 0 | 0 | 14.44 | 1.11 | 0.37 | 0 | 0 | 0 | 6.67 | 0 | 0 | 77.41 | 0 |
○Thread 236479 | 1.35 | 0 | 0 | 13.33 | 1.48 | 0 | 0 | 0 | 0 | 4.07 | 0 | 0 | 81.11 | 0 |
○Thread 236480 | 1.35 | 0 | 0 | 13.33 | 2.96 | 0 | 0 | 0 | 0 | 5.93 | 0 | 0 | 77.78 | 0 |
▼m1o26 | 0.91 | 0 | 0 | 19 | 1.83 | 0.08 | 0 | 0.02 | 0 | 5.31 | 0 | 0 | 73.76 | 0 |
▼Node skylake | 0.91 | 0 | 0 | 19 | 1.83 | 0.08 | 0 | 0.02 | 0 | 5.31 | 0 | 0 | 73.76 | 0 |
▼Process 236485 | 0.91 | 0 | 0 | 19 | 1.83 | 0.08 | 0 | 0.02 | 0 | 5.31 | 0 | 0 | 73.76 | 0 |
○Thread 236485 | 0.87 | 0 | 0 | 0.57 | 2.3 | 0.57 | 0 | 0.57 | 0 | 16.09 | 0 | 0 | 79.89 | 0 |
○Thread 236538 | 0.91 | 0 | 0 | 19.67 | 1.09 | 0 | 0 | 0 | 0 | 6.56 | 0 | 0 | 72.68 | 0 |
○Thread 236539 | 0.91 | 0 | 0 | 21.31 | 1.64 | 0 | 0 | 0 | 0 | 3.83 | 0 | 0 | 73.22 | 0 |
○Thread 236540 | 0.91 | 0 | 0 | 19.67 | 1.09 | 0 | 0 | 0 | 0 | 3.83 | 0 | 0 | 75.41 | 0 |
○Thread 236541 | 0.91 | 0 | 0 | 18.03 | 0.55 | 0 | 0 | 0 | 0 | 6.01 | 0 | 0 | 75.41 | 0 |
○Thread 236542 | 0.91 | 0 | 0 | 19.13 | 2.73 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 73.22 | 0 |
○Thread 236543 | 0.91 | 0 | 0 | 20.88 | 3.85 | 0 | 0 | 0 | 0 | 5.49 | 0 | 0 | 69.78 | 0 |
○Thread 236544 | 0.91 | 0 | 0 | 20.22 | 1.64 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 73.22 | 0 |
○Thread 236545 | 0.91 | 0 | 0 | 20.22 | 2.19 | 0 | 0 | 0 | 0 | 2.19 | 0 | 0 | 75.41 | 0 |
○Thread 236546 | 0.91 | 0 | 0 | 19.67 | 1.09 | 0 | 0 | 0 | 0 | 3.83 | 0 | 0 | 75.41 | 0 |
○Thread 236547 | 0.91 | 0 | 0 | 19.67 | 1.09 | 0.55 | 0 | 0 | 0 | 5.46 | 0 | 0 | 73.22 | 0 |
○Thread 236548 | 0.91 | 0 | 0 | 20.22 | 1.64 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 73.22 | 0 |
○Thread 236549 | 0.91 | 0 | 0 | 19.67 | 2.73 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 72.68 | 0 |
○Thread 236550 | 0.91 | 0 | 0 | 19.67 | 2.19 | 0.55 | 0 | 0 | 0 | 5.46 | 0 | 0 | 72.13 | 0 |
○Thread 236551 | 0.91 | 0 | 0 | 19.67 | 2.19 | 0 | 0 | 0 | 0 | 2.73 | 0 | 0 | 75.41 | 0 |
○Thread 236552 | 0.91 | 0 | 0 | 19.13 | 3.83 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 72.13 | 0 |
○Thread 236553 | 0.91 | 0 | 0 | 19.67 | 0 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 75.41 | 0 |
○Thread 236554 | 0.91 | 0 | 0 | 19.13 | 2.73 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 73.22 | 0 |
○Thread 236555 | 0.91 | 0 | 0 | 20.22 | 2.73 | 0 | 0 | 0 | 0 | 6.01 | 0 | 0 | 71.04 | 0 |
○Thread 236556 | 0.91 | 0 | 0 | 19.67 | 0.55 | 0.55 | 0 | 0 | 0 | 4.37 | 0 | 0 | 74.86 | 0 |
○Thread 236557 | 0.91 | 0 | 0 | 20.22 | 0 | 0 | 0 | 0 | 0 | 6.56 | 0 | 0 | 73.22 | 0 |
○Thread 236558 | 0.91 | 0 | 0 | 19.67 | 1.09 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 74.32 | 0 |
○Thread 236559 | 0.91 | 0 | 0 | 18.58 | 2.19 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 74.32 | 0 |
○Thread 236560 | 0.91 | 0 | 0 | 19.67 | 2.19 | 0 | 0 | 0 | 0 | 4.37 | 0 | 0 | 73.77 | 0 |
○Thread 236561 | 0.91 | 0 | 0 | 20.22 | 3.28 | 0 | 0 | 0 | 0 | 6.56 | 0 | 0 | 69.95 | 0 |
○Thread 236562 | 0.91 | 0 | 0 | 18.58 | 1.09 | 0 | 0 | 0 | 0 | 4.92 | 0 | 0 | 75.41 | 0 |
▼m1o52 | 0.63 | 0 | 0 | 30.77 | 1.3 | 0.07 | 0 | 0.02 | 0 | 7.6 | 0 | 0 | 60.25 | 0 |
▼Node skylake | 0.63 | 0 | 0 | 30.77 | 1.3 | 0.07 | 0 | 0.02 | 0 | 7.6 | 0 | 0 | 60.25 | 0 |
▼Process 236567 | 0.63 | 0 | 0 | 30.77 | 1.3 | 0.07 | 0 | 0.02 | 0 | 7.6 | 0 | 0 | 60.25 | 0 |
○Thread 236567 | 0.59 | 0 | 0 | 11.02 | 0.85 | 0.85 | 0 | 0.85 | 0 | 22.88 | 0 | 0 | 63.56 | 0 |
○Thread 236620 | 0.63 | 0 | 0 | 37.3 | 0 | 0 | 0 | 0 | 0 | 6.35 | 0 | 0 | 56.35 | 0 |
○Thread 236621 | 0.63 | 0 | 0 | 37.8 | 0.79 | 0 | 0 | 0 | 0 | 7.09 | 0 | 0 | 54.33 | 0 |
○Thread 236622 | 0.62 | 0 | 0 | 34.68 | 2.42 | 0.81 | 0 | 0 | 0 | 8.06 | 0 | 0 | 54.03 | 0 |
○Thread 236623 | 0.62 | 0 | 0 | 37.6 | 0.8 | 0 | 0 | 0 | 0 | 10.4 | 0 | 0 | 51.2 | 0 |
○Thread 236624 | 0.63 | 0 | 0 | 39.37 | 1.57 | 0 | 0 | 0 | 0 | 3.94 | 0 | 0 | 55.12 | 0 |
○Thread 236625 | 0.63 | 0 | 0 | 32.28 | 0.79 | 0.79 | 0 | 0 | 0 | 6.3 | 0 | 0 | 59.84 | 0 |
○Thread 236626 | 0.54 | 0 | 0 | 21.3 | 0 | 0 | 0 | 0 | 0 | 6.48 | 0 | 0 | 72.22 | 0 |
○Thread 236627 | 0.63 | 0 | 0 | 35.43 | 0.79 | 0 | 0 | 0 | 0 | 9.45 | 0 | 0 | 54.33 | 0 |
○Thread 236628 | 0.63 | 0 | 0 | 36.22 | 0.79 | 0 | 0 | 0 | 0 | 6.3 | 0 | 0 | 56.69 | 0 |
○Thread 236629 | 0.63 | 0 | 0 | 37.3 | 1.59 | 0 | 0 | 0 | 0 | 4.76 | 0 | 0 | 56.35 | 0 |
○Thread 236630 | 0.63 | 0 | 0 | 37.8 | 0 | 0 | 0 | 0 | 0 | 7.09 | 0 | 0 | 55.12 | 0 |
○Thread 236631 | 0.55 | 0 | 0 | 23.64 | 0.91 | 0 | 0 | 0 | 0 | 10 | 0 | 0 | 65.45 | 0 |
○Thread 236632 | 0.56 | 0 | 0 | 27.68 | 1.79 | 0 | 0 | 0 | 0 | 7.14 | 0 | 0 | 63.39 | 0 |
○Thread 236633 | 0.54 | 0 | 0 | 26.61 | 2.75 | 0 | 0 | 0 | 0 | 3.67 | 0 | 0 | 66.97 | 0 |
○Thread 236634 | 0.63 | 0 | 0 | 40.16 | 1.57 | 0 | 0 | 0 | 0 | 7.87 | 0 | 0 | 50.39 | 0 |
○Thread 236635 | 0.55 | 0 | 0 | 27.03 | 2.7 | 0 | 0 | 0 | 0 | 8.11 | 0 | 0 | 62.16 | 0 |
○Thread 236636 | 0.55 | 0 | 0 | 24.32 | 2.7 | 0 | 0 | 0 | 0 | 6.31 | 0 | 0 | 66.67 | 0 |
○Thread 236637 | 0.54 | 0 | 0 | 24.77 | 1.83 | 0 | 0 | 0 | 0 | 7.34 | 0 | 0 | 66.06 | 0 |
○Thread 236638 | 0.62 | 0 | 0 | 33.87 | 0.81 | 0 | 0 | 0 | 0 | 8.06 | 0 | 0 | 57.26 | 0 |
○Thread 236639 | 0.55 | 0 | 0 | 25.45 | 1.82 | 0 | 0 | 0 | 0 | 8.18 | 0 | 0 | 64.55 | 0 |
○Thread 236640 | 0.54 | 0 | 0 | 23.85 | 3.67 | 0 | 0 | 0 | 0 | 4.59 | 0 | 0 | 67.89 | 0 |
○Thread 236641 | 0.54 | 0 | 0 | 24.77 | 0.92 | 0 | 0 | 0 | 0 | 7.34 | 0 | 0 | 66.97 | 0 |
○Thread 236642 | 0.53 | 0 | 0 | 23.36 | 0 | 0 | 0 | 0 | 0 | 7.48 | 0 | 0 | 69.16 | 0 |
○Thread 236643 | 0.63 | 0 | 0 | 35.43 | 0 | 0 | 0 | 0 | 0 | 11.81 | 0 | 0 | 52.76 | 0 |
○Thread 236644 | 0.61 | 0 | 0 | 33.33 | 1.63 | 0 | 0 | 0 | 0 | 8.13 | 0 | 0 | 56.91 | 0 |
○Thread 236645 | 0.62 | 0 | 0 | 36.8 | 0 | 0 | 0 | 0 | 0 | 8 | 0 | 0 | 55.2 | 0 |
○Thread 236646 | 0.61 | 0 | 0 | 34.15 | 0.81 | 0 | 0 | 0 | 0 | 8.94 | 0 | 0 | 56.1 | 0 |
○Thread 236647 | 0.56 | 0 | 0 | 25.89 | 3.57 | 0 | 0 | 0 | 0 | 8.04 | 0 | 0 | 62.5 | 0 |
○Thread 236648 | 0.62 | 0 | 0 | 34.68 | 0 | 0 | 0 | 0 | 0 | 8.06 | 0 | 0 | 57.26 | 0 |
○Thread 236649 | 0.63 | 0 | 0 | 36.51 | 0.79 | 0 | 0 | 0 | 0 | 6.35 | 0 | 0 | 56.35 | 0 |
○Thread 236650 | 0.55 | 0 | 0 | 22.73 | 3.64 | 0 | 0 | 0 | 0 | 7.27 | 0 | 0 | 66.36 | 0 |
○Thread 236651 | 0.62 | 0 | 0 | 36 | 1.6 | 0 | 0 | 0 | 0 | 6.4 | 0 | 0 | 56 | 0 |
○Thread 236652 | 0.56 | 0 | 0 | 24.11 | 0 | 0 | 0 | 0 | 0 | 8.93 | 0 | 0 | 66.96 | 0 |
○Thread 236653 | 0.63 | 0 | 0 | 36.51 | 0.79 | 0 | 0 | 0 | 0 | 3.97 | 0 | 0 | 58.73 | 0 |
○Thread 236654 | 0.54 | 0 | 0 | 24.07 | 0.93 | 0 | 0 | 0 | 0 | 9.26 | 0 | 0 | 65.74 | 0 |
○Thread 236655 | 0.54 | 0 | 0 | 24.77 | 0.92 | 0 | 0 | 0 | 0 | 10.09 | 0 | 0 | 64.22 | 0 |
○Thread 236656 | 0.55 | 0 | 0 | 25.45 | 0.91 | 0 | 0 | 0 | 0 | 9.09 | 0 | 0 | 64.55 | 0 |
○Thread 236657 | 0.55 | 0 | 0 | 24.55 | 0.91 | 0 | 0 | 0 | 0 | 6.36 | 0 | 0 | 68.18 | 0 |
○Thread 236658 | 0.53 | 0 | 0 | 25.23 | 1.87 | 0 | 0 | 0 | 0 | 4.67 | 0 | 0 | 68.22 | 0 |
○Thread 236659 | 0.63 | 0 | 0 | 37.3 | 1.59 | 0 | 0 | 0 | 0 | 3.17 | 0 | 0 | 57.94 | 0 |
○Thread 236660 | 0.53 | 0 | 0 | 23.36 | 1.87 | 0 | 0 | 0 | 0 | 8.41 | 0 | 0 | 66.36 | 0 |
○Thread 236661 | 0.53 | 0 | 0 | 24.3 | 2.8 | 0 | 0 | 0 | 0 | 6.54 | 0 | 0 | 66.36 | 0 |
○Thread 236662 | 0.62 | 0 | 0 | 36.29 | 0.81 | 0 | 0 | 0 | 0 | 4.84 | 0 | 0 | 58.06 | 0 |
○Thread 236663 | 0.54 | 0 | 0 | 25.93 | 0 | 0 | 0 | 0 | 0 | 8.33 | 0 | 0 | 65.74 | 0 |
○Thread 236664 | 0.62 | 0 | 0 | 33.06 | 0.81 | 0 | 0 | 0 | 0 | 6.45 | 0 | 0 | 59.68 | 0 |
○Thread 236665 | 0.56 | 0 | 0 | 27.68 | 0.89 | 0 | 0 | 0 | 0 | 8.93 | 0 | 0 | 62.5 | 0 |
○Thread 236666 | 0.55 | 0 | 0 | 23.64 | 0.91 | 0 | 0 | 0 | 0 | 10.91 | 0 | 0 | 64.55 | 0 |
○Thread 236667 | 0.63 | 0 | 0 | 38.1 | 1.59 | 0 | 0 | 0 | 0 | 5.56 | 0 | 0 | 54.76 | 0 |
○Thread 236668 | 0.63 | 0 | 0 | 38.58 | 1.57 | 0.79 | 0 | 0 | 0 | 7.09 | 0 | 0 | 51.97 | 0 |
○Thread 236669 | 0.61 | 0 | 0 | 34.96 | 2.44 | 0 | 0 | 0 | 0 | 8.13 | 0 | 0 | 54.47 | 0 |
○Thread 236670 | 0.62 | 0 | 0 | 32.8 | 3.2 | 0 | 0 | 0 | 0 | 7.2 | 0 | 0 | 56.8 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | IO (%) | Memory (%) | libqmckl.so.0.0.0 (%) | Others (%) |
---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 2 | 0.03 | 0.03 | 4.91 | 93.04 | 0 |
m1o2 | 2 | 0.93 | 2.29 | 0.05 | 0.03 | 5.54 | 91.13 | 0.03 |
m1o4 | 4 | 6.08 | 2.33 | 0.1 | 0.05 | 5.08 | 86.35 | 0 |
m1o8 | 8 | 8.47 | 1.78 | 0.1 | 0.02 | 5.32 | 84.3 | 0 |
m1o16 | 16 | 12.88 | 2.06 | 0.07 | 0.02 | 5.59 | 79.35 | 0.02 |
m1o26 | 26 | 19 | 1.83 | 0.08 | 0.02 | 5.31 | 73.76 | 0 |
m1o52 | 52 | 30.77 | 1.3 | 0.07 | 0.02 | 7.6 | 60.25 | 0 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|
m1o1 | 1 | 19.25 | 0 | 0.39 | 0.95 | 17.91 |
m1o2 | 2 | 9.41 | 0.09 | 0.22 | 0.52 | 8.58 |
m1o4 | 4 | 5.02 | 0.31 | 0.12 | 0.26 | 4.33 |
m1o8 | 8 | 2.57 | 0.22 | 0.05 | 0.14 | 2.17 |
m1o16 | 16 | 1.35 | 0.17 | 0.03 | 0.08 | 1.07 |
m1o26 | 26 | 0.91 | 0.17 | 0.02 | 0.05 | 0.67 |
m1o52 | 52 | 0.63 | 0.19 | 0.01 | 0.05 | 0.38 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 1.01 |
m1o4 | 4 | 0.93 |
m1o8 | 8 | 0.88 |
m1o16 | 16 | 0.78 |
m1o26 | 26 | 0.66 |
m1o52 | 52 | 0.43 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0.08 | 0 | 0 | 0 | 0.08 | 0 | 0 | 2.16 | 0.37 | 96.28 | 1.03 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2.13 | 0.35 | 91.17 | 6.35 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.28 | 85.88 | 8.84 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9.26 | 0 | 77.03 | 13.71 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 1.64 | 0 | 4.89 | 73.77 | 19.7 |
m1o52 | 52 | 0 | 0 | 0 | 7.29 | 0 | 0 | 3.65 | 0 | 21.02 | 36.77 | 31.27 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0.08 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.14 | 98.81 | 0 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.39 | 93.65 | 0 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8.82 | 91.16 | 0.02 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13.71 | 86.29 | 0 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19.7 | 80.3 | 0 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31.31 | 68.73 | 0 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.61 | 3.39 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93.69 | 6.31 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88.24 | 11.76 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 | 70.05 | 20.95 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9.05 | 0 | 60.75 | 30.2 |
m1o52 | 52 | 0 | 0 | 0 | 12.04 | 0 | 0 | 0 | 0 | 31.49 | 10.96 | 45.51 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.61 | 3.39 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93.69 | 6.31 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88.24 | 11.76 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 79.05 | 20.95 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 69.8 | 30.2 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54.49 | 45.51 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libtrexio/__install/lib/libtrexio.so.0.0.0 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libirng.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_core.so.2 | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/icc_2017.4/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |