| Detailed Application Categorization |
| Detailed Function Times |
| Scalability - Coverage per Category |
| Scalability - Time per Category |
| Scalability - Efficiency |
| Function Based Profile |
| Scalability - Coverage per Parallel Efficiency |
| Scalability - Coverage per Parallel Speedup |
| Libraries |
Detailed Application Categorization
| ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ▼run_1_thread | 160.11 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 160.11 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 7079 | 160.11 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 7079) | 160.11 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_2_threads | 84.38 | 97.03 | 0.00 | 2.95 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 84.38 | 97.03 | 0.00 | 2.95 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 7110 | 84.38 | 97.03 | 0.00 | 2.95 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 7110) | 84.38 | 99.87 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 7125) | 81.94 | 94.11 | 0.00 | 5.86 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_4_threads | 48.57 | 92.13 | 0.00 | 7.83 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 48.57 | 92.13 | 0.00 | 7.83 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 7142 | 48.57 | 92.13 | 0.00 | 7.83 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 7142) | 48.57 | 99.67 | 0.00 | 0.31 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 7157) | 45.82 | 89.49 | 0.00 | 10.44 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 7158) | 45.77 | 89.47 | 0.00 | 10.50 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 7159) | 45.77 | 89.45 | 0.00 | 10.52 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_8_threads | 28.96 | 83.90 | 0.00 | 16.06 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 28.96 | 83.90 | 0.00 | 16.06 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 7176 | 28.96 | 83.90 | 0.00 | 16.06 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 7176) | 28.96 | 99.41 | 0.00 | 0.59 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 1 (TID 7191) | 26.07 | 81.45 | 0.00 | 18.49 | 0.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 7192) | 26.02 | 81.46 | 0.00 | 18.52 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 7193) | 26.02 | 81.46 | 0.00 | 18.49 | 0.00 | 0.00 | 0.02 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 4 (TID 7194) | 26.03 | 81.39 | 0.00 | 18.56 | 0.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 5 (TID 7195) | 26.02 | 81.46 | 0.00 | 18.50 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 6 (TID 7196) | 26.01 | 81.43 | 0.00 | 18.49 | 0.00 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 7 (TID 7197) | 26.01 | 81.40 | 0.00 | 18.57 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼run_10_threads | 24.74 | 80.23 | 0.00 | 19.69 | 0.00 | 0.00 | 0.01 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Node otterfall | 24.74 | 80.23 | 0.00 | 19.69 | 0.00 | 0.00 | 0.01 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
| ▼Process 7214 | 24.74 | 80.23 | 0.00 | 19.69 | 0.00 | 0.00 | 0.01 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 0 (TID 7214) | 24.74 | 99.27 | 0.00 | 0.71 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 |
| ○OMP # 1 (TID 7229) | 21.86 | 77.85 | 0.00 | 21.99 | 0.00 | 0.00 | 0.05 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 2 (TID 7230) | 21.77 | 77.79 | 0.00 | 22.21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 3 (TID 7231) | 21.77 | 77.86 | 0.00 | 22.03 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 4 (TID 7232) | 21.78 | 77.84 | 0.00 | 22.14 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 5 (TID 7233) | 21.76 | 77.83 | 0.00 | 22.08 | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 6 (TID 7234) | 21.76 | 77.88 | 0.00 | 22.05 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 7 (TID 7235) | 21.77 | 77.81 | 0.00 | 22.14 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 8 (TID 7236) | 21.79 | 77.79 | 0.00 | 22.07 | 0.00 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 |
| ○OMP # 9 (TID 7237) | 21.78 | 77.82 | 0.00 | 22.11 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
| Run | Number of threads | Binary (%) | OMP (%) | Pthread (%) |
|---|---|---|---|---|
| run_1_thread | 1 | 99.99 | 0 | 0 |
| run_2_threads | 2 | 97.03 | 2.95 | 0.01 |
| run_4_threads | 4 | 92.13 | 7.83 | 0.03 |
| run_8_threads | 8 | 83.9 | 16.06 | 0.04 |
| run_10_threads | 10 | 80.23 | 19.69 | 0.06 |
Scalability - Time per Category
Detailed Time per Category
| Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) | Pthread (s) |
|---|---|---|---|---|---|---|
| run_1_thread | 1 | 160.11 | 160.09 | 0 | 0.01 | 0 |
| run_2_threads | 2 | 84.38 | 81.87 | 2.49 | 0.01 | 0.01 |
| run_4_threads | 4 | 48.57 | 44.74 | 3.8 | 0 | 0.01 |
| run_8_threads | 8 | 28.96 | 24.29 | 4.65 | 0 | 0.01 |
| run_10_threads | 10 | 24.74 | 19.85 | 4.87 | 0 | 0.02 |
Scalability - Efficiency
Detailed Efficiency
| Run | Number of observed threads | Efficiency (ideal is 1) |
|---|---|---|
| run_1_thread | 1 | 1 |
| run_2_threads | 2 | 0.95 |
| run_4_threads | 4 | 0.82 |
| run_8_threads | 8 | 0.69 |
| run_10_threads | 10 | 0.65 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_2_threads | 2 | 0 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 0 | 97.03 | 2.95 |
| run_4_threads | 4 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 0 | 88.05 | 4.08 | 7.84 |
| run_8_threads | 8 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 83.9 | 0 | 16.07 |
| run_10_threads | 10 | 0 | 0.04 | 0 | 0 | 0 | 0 | 0 | 0 | 80.24 | 0 | 19.73 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 2.95 | 97.03 | 0 |
| run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.86 | 92.14 | 0 |
| run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16.07 | 83.93 | 0 |
| run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19.73 | 80.27 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
| Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
|---|---|---|---|---|---|
| /opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
| /opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
| /usr/lib/ld-linux-x86-64.so.2 | |||||
| /usr/lib/libc.so.6 | |||||
| /usr/lib/libdl.so.2 | |||||
| /usr/lib/libgcc_s.so.1 | |||||
| /usr/lib/libm.so.6 | |||||
| /usr/lib/libomp.so | |||||
| /usr/lib/libpthread.so.0 | |||||
| /usr/lib/libstdc++.so.6.0.34 |

