Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 96.57 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 96.57 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5841 | 96.57 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5841) | 96.57 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 49.16 | 94.83 | 0.00 | 5.14 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 49.16 | 94.83 | 0.00 | 5.14 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5878 | 49.16 | 94.83 | 0.00 | 5.14 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5878) | 49.16 | 99.73 | 0.00 | 0.23 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 5900) | 46.60 | 89.67 | 0.00 | 10.31 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 30.12 | 86.90 | 0.00 | 13.07 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 30.12 | 86.90 | 0.00 | 13.07 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5917 | 30.12 | 86.90 | 0.00 | 13.07 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5917) | 30.12 | 99.54 | 0.00 | 0.45 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 5939) | 27.20 | 82.28 | 0.00 | 17.68 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 5940) | 27.13 | 82.10 | 0.00 | 17.84 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 5941) | 27.13 | 82.31 | 0.00 | 17.67 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 19.54 | 74.83 | 0.00 | 25.10 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 19.54 | 74.83 | 0.00 | 25.10 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5957 | 19.54 | 74.83 | 0.00 | 25.10 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5957) | 19.54 | 99.18 | 0.00 | 0.79 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 5979) | 16.56 | 70.79 | 0.00 | 29.18 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 5980) | 16.48 | 70.76 | 0.00 | 29.15 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 5981) | 16.48 | 70.73 | 0.00 | 29.18 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 5982) | 16.51 | 70.61 | 0.00 | 29.17 | 0.00 | 0.00 | 0.21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 5983) | 16.48 | 70.67 | 0.00 | 29.21 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 5984) | 16.48 | 70.70 | 0.00 | 29.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 5985) | 16.48 | 70.66 | 0.00 | 29.31 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 17.96 | 71.21 | 0.00 | 28.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 17.96 | 71.21 | 0.00 | 28.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6002 | 17.96 | 71.21 | 0.00 | 28.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6002) | 17.96 | 99.11 | 0.00 | 0.84 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 6024) | 14.87 | 67.72 | 0.00 | 32.21 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 6025) | 14.79 | 67.31 | 0.00 | 32.66 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 6026) | 14.83 | 67.43 | 0.00 | 32.43 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 6027) | 14.85 | 67.45 | 0.00 | 32.38 | 0.00 | 0.00 | 0.13 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 6028) | 14.85 | 67.44 | 0.00 | 32.53 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 6029) | 14.84 | 67.39 | 0.00 | 32.61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 6030) | 14.81 | 67.47 | 0.00 | 32.40 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 6031) | 14.83 | 67.41 | 0.00 | 32.59 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 6032) | 14.81 | 67.52 | 0.00 | 32.48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) |
---|---|---|---|---|
run_1_thread | 1 | 99.98 | 0 | 0.02 |
run_2_threads | 2 | 94.83 | 5.14 | 0.03 |
run_4_threads | 4 | 86.9 | 13.07 | 0.03 |
run_8_threads | 8 | 74.83 | 25.1 | 0.07 |
run_10_threads | 10 | 71.21 | 28.72 | 0.06 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 96.57 | 96.56 | 0 | 0.01 |
run_2_threads | 2 | 49.16 | 46.62 | 2.53 | 0.02 |
run_4_threads | 4 | 30.12 | 26.17 | 3.94 | 0.01 |
run_8_threads | 8 | 19.54 | 14.62 | 4.91 | 0.01 |
run_10_threads | 10 | 17.96 | 12.79 | 5.16 | 0.01 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.98 |
run_4_threads | 4 | 0.8 |
run_8_threads | 8 | 0.62 |
run_10_threads | 10 | 0.54 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 94.84 | 5.15 |
run_4_threads | 4 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 6.95 | 79.95 | 13.07 |
run_8_threads | 8 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 74.83 | 0 | 25.14 |
run_10_threads | 10 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 71.21 | 0 | 0 | 28.76 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.15 | 94.85 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13.07 | 86.93 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25.14 | 74.86 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28.76 | 71.24 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/librt.so.1 | |||||
/usr/lib/libstdc++.so.6.0.34 |