Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 161.70 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 161.70 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6479 | 161.70 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6479) | 161.70 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 86.66 | 97.01 | 0.00 | 2.95 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 86.66 | 97.01 | 0.00 | 2.95 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6517 | 86.66 | 97.01 | 0.00 | 2.95 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6517) | 86.66 | 99.83 | 0.00 | 0.15 | 0.00 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 6539) | 80.07 | 93.97 | 0.00 | 5.99 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 51.82 | 92.12 | 0.00 | 7.84 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 51.82 | 92.12 | 0.00 | 7.84 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6556 | 51.82 | 92.12 | 0.00 | 7.84 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6556) | 51.82 | 99.68 | 0.00 | 0.30 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 6578) | 44.88 | 89.21 | 0.00 | 10.73 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 6579) | 44.81 | 89.21 | 0.00 | 10.75 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 6580) | 44.80 | 89.22 | 0.00 | 10.76 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 32.60 | 83.90 | 0.00 | 16.05 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 32.60 | 83.90 | 0.00 | 16.05 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6596 | 32.60 | 83.90 | 0.00 | 16.05 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6596) | 32.60 | 99.48 | 0.00 | 0.48 | 0.00 | 0.00 | 0.03 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 6618) | 25.55 | 81.17 | 0.00 | 18.81 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 6619) | 25.48 | 81.07 | 0.00 | 18.87 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 6620) | 25.49 | 80.97 | 0.00 | 18.97 | 0.00 | 0.00 | 0.04 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 6621) | 25.50 | 81.02 | 0.00 | 18.92 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 6622) | 25.49 | 81.07 | 0.00 | 18.93 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 6623) | 25.46 | 81.01 | 0.00 | 18.89 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 6624) | 25.47 | 81.09 | 0.00 | 18.87 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 28.50 | 80.25 | 0.00 | 19.70 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 28.50 | 80.25 | 0.00 | 19.70 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 6641 | 28.50 | 80.25 | 0.00 | 19.70 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 6641) | 28.50 | 99.42 | 0.00 | 0.56 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 6663) | 21.44 | 77.52 | 0.00 | 22.43 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 6664) | 21.40 | 77.52 | 0.00 | 22.41 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 6665) | 21.38 | 77.33 | 0.00 | 22.62 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 6666) | 21.39 | 77.38 | 0.00 | 22.58 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 6667) | 21.38 | 77.34 | 0.00 | 22.59 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 6668) | 21.38 | 77.52 | 0.00 | 22.43 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 6669) | 21.36 | 77.36 | 0.00 | 22.54 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 6670) | 21.42 | 77.27 | 0.00 | 22.73 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 6671) | 21.41 | 77.51 | 0.00 | 22.45 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) |
---|---|---|---|---|
run_1_thread | 1 | 99.98 | 0 | 0.02 |
run_2_threads | 2 | 97.01 | 2.95 | 0.03 |
run_4_threads | 4 | 92.12 | 7.84 | 0.04 |
run_8_threads | 8 | 83.9 | 16.05 | 0.04 |
run_10_threads | 10 | 80.25 | 19.7 | 0.05 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 161.7 | 161.67 | 0 | 0.03 |
run_2_threads | 2 | 86.66 | 84.07 | 2.56 | 0.03 |
run_4_threads | 4 | 51.82 | 47.74 | 4.06 | 0.02 |
run_8_threads | 8 | 32.6 | 27.35 | 5.23 | 0.01 |
run_10_threads | 10 | 28.5 | 22.87 | 5.61 | 0.01 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.93 |
run_4_threads | 4 | 0.78 |
run_8_threads | 8 | 0.62 |
run_10_threads | 10 | 0.57 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 97.02 | 2.96 |
run_4_threads | 4 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 92.12 | 0 | 7.84 |
run_8_threads | 8 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 83.9 | 0 | 0 | 16.07 |
run_10_threads | 10 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 80.26 | 0 | 0 | 19.71 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2.96 | 97.04 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.84 | 92.16 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16.07 | 83.93 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19.71 | 80.29 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/librt.so.1 | |||||
/usr/lib/libstdc++.so.6.0.34 |