| Detailed Application Categorization |
| Detailed Function Times |
| Scalability - Coverage per Category |
| Scalability - Time per Category |
| Scalability - Efficiency |
| Function Based Profile |
| Scalability - Coverage per Parallel Efficiency |
| Scalability - Coverage per Parallel Speedup |
| Libraries |
Detailed Application Categorization
| ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ▼run_1_thread | 101.84 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 101.84 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 1506383 | 101.84 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 1506383) | 101.84 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_2_threads | 55.86 | 95.61 | 0.00 | 4.37 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 55.86 | 95.61 | 0.00 | 4.37 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 1506424 | 55.86 | 95.61 | 0.00 | 4.37 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 1506424) | 55.86 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 1506446) | 53.18 | 91.01 | 0.00 | 8.97 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_4_threads | 33.17 | 88.42 | 0.00 | 11.56 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 33.17 | 88.42 | 0.00 | 11.56 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 1506467 | 33.17 | 88.42 | 0.00 | 11.56 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 1506467) | 33.17 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 1506489) | 30.25 | 84.18 | 0.00 | 15.78 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 1506490) | 30.25 | 84.25 | 0.00 | 15.75 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 1506491) | 30.25 | 84.13 | 0.00 | 15.83 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_8_threads | 21.27 | 76.44 | 0.00 | 23.51 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 21.27 | 76.44 | 0.00 | 23.51 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 1506509 | 21.27 | 76.44 | 0.00 | 23.51 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 1506509) | 21.27 | 97.58 | 0.00 | 2.40 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 1506531) | 17.88 | 73.08 | 0.00 | 26.89 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 1506532) | 17.81 | 72.91 | 0.00 | 27.04 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 1506533) | 17.89 | 73.04 | 0.00 | 26.88 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 4 (TID 1506534) | 18.14 | 71.59 | 0.00 | 28.33 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 5 (TID 1506535) | 17.81 | 72.94 | 0.00 | 27.01 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 6 (TID 1506536) | 17.89 | 73.12 | 0.00 | 26.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 7 (TID 1506537) | 18.09 | 73.36 | 0.00 | 26.59 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_10_threads | 19.32 | 71.89 | 0.00 | 28.02 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 19.32 | 71.89 | 0.00 | 28.02 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 1506555 | 19.32 | 71.89 | 0.00 | 28.02 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 1506555) | 19.32 | 95.60 | 0.00 | 4.37 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 1506577) | 15.40 | 68.72 | 0.00 | 31.05 | 0.00 | 0.00 | 0.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 1506578) | 15.37 | 68.67 | 0.00 | 31.26 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 1506579) | 15.58 | 69.04 | 0.00 | 30.83 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 4 (TID 1506580) | 15.97 | 67.87 | 0.00 | 32.04 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 5 (TID 1506581) | 15.33 | 68.59 | 0.00 | 31.38 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 6 (TID 1506582) | 15.36 | 68.70 | 0.00 | 31.27 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 7 (TID 1506583) | 15.69 | 69.16 | 0.00 | 30.71 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 8 (TID 1506584) | 15.26 | 68.32 | 0.00 | 31.59 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 9 (TID 1506585) | 15.21 | 68.32 | 0.00 | 31.61 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
| Run | Number of threads | Binary (%) | OMP (%) | System (%) |
|---|---|---|---|---|
| run_1_thread | 1 | 100 | 0 | 0 |
| run_2_threads | 2 | 95.61 | 4.37 | 0.01 |
| run_4_threads | 4 | 88.42 | 11.56 | 0.02 |
| run_8_threads | 8 | 76.44 | 23.51 | 0.05 |
| run_10_threads | 10 | 71.89 | 28.02 | 0.09 |
Scalability - Time per Category
Detailed Time per Category
| Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
|---|---|---|---|---|---|
| run_1_thread | 1 | 101.84 | 101.84 | 0 | 0 |
| run_2_threads | 2 | 55.86 | 53.41 | 2.44 | 0.01 |
| run_4_threads | 4 | 33.17 | 29.33 | 3.84 | 0.01 |
| run_8_threads | 8 | 21.27 | 16.26 | 5 | 0.01 |
| run_10_threads | 10 | 19.32 | 13.89 | 5.41 | 0.02 |
Scalability - Efficiency
Detailed Efficiency
| Run | Number of observed threads | Efficiency (ideal is 1) |
|---|---|---|
| run_1_thread | 1 | 1 |
| run_2_threads | 2 | 0.91 |
| run_4_threads | 4 | 0.76 |
| run_8_threads | 8 | 0.6 |
| run_10_threads | 10 | 0.53 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.61 | 4.39 |
| run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88.42 | 0 | 11.58 |
| run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 71.15 | 5.29 | 0 | 23.56 |
| run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 71.89 | 0 | 0 | 28.11 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.39 | 95.61 | 0 |
| run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11.58 | 88.42 | 0 |
| run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23.56 | 76.44 | 0 |
| run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28.11 | 71.89 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
| Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
|---|---|---|---|---|---|
| /opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
| /usr/lib/ld-linux-x86-64.so.2 | |||||
| /usr/lib/libc.so.6 | |||||
| /usr/lib/libdl.so.2 | |||||
| /usr/lib/libgcc_s.so.1 | |||||
| /usr/lib/libm.so.6 | |||||
| /usr/lib/libpthread.so.0 | |||||
| /usr/lib/librt.so.1 | |||||
| /usr/lib/libstdc++.so.6.0.34 |

