Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 150.77 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 150.77 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 7432 | 150.77 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 7432) | 150.77 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 79.67 | 96.84 | 0.00 | 3.15 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 79.67 | 96.84 | 0.00 | 3.15 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 7464 | 79.67 | 96.84 | 0.00 | 3.15 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 7464) | 79.67 | 99.84 | 0.00 | 0.16 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 7479) | 77.12 | 93.74 | 0.00 | 6.24 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 46.21 | 91.71 | 0.00 | 8.26 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 46.21 | 91.71 | 0.00 | 8.26 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 7496 | 46.21 | 91.71 | 0.00 | 8.26 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 7496) | 46.21 | 99.72 | 0.00 | 0.27 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 7511) | 43.29 | 88.88 | 0.00 | 11.09 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 7512) | 43.24 | 88.84 | 0.00 | 11.08 | 0.00 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 7513) | 43.25 | 88.85 | 0.00 | 11.14 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 27.83 | 83.04 | 0.00 | 16.90 | 0.00 | 0.00 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 27.83 | 83.04 | 0.00 | 16.90 | 0.00 | 0.00 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 7529 | 27.83 | 83.04 | 0.00 | 16.90 | 0.00 | 0.00 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 7529) | 27.83 | 99.26 | 0.00 | 0.65 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 7544) | 24.75 | 80.44 | 0.00 | 19.54 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 7545) | 24.70 | 80.47 | 0.00 | 19.47 | 0.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 7546) | 24.69 | 80.46 | 0.00 | 19.50 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 7547) | 24.72 | 80.32 | 0.00 | 19.58 | 0.00 | 0.00 | 0.04 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 7548) | 24.70 | 80.43 | 0.00 | 19.55 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 7549) | 24.69 | 80.42 | 0.00 | 19.50 | 0.00 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 7550) | 24.70 | 80.50 | 0.00 | 19.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 23.87 | 79.24 | 0.00 | 20.71 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 23.87 | 79.24 | 0.00 | 20.71 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 7566 | 23.87 | 79.24 | 0.00 | 20.71 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 7566) | 23.87 | 99.18 | 0.00 | 0.80 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 7581) | 20.78 | 76.76 | 0.00 | 23.22 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 7582) | 20.71 | 76.70 | 0.00 | 23.20 | 0.00 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 7583) | 20.70 | 76.70 | 0.00 | 23.28 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 7584) | 20.73 | 76.68 | 0.00 | 23.28 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 7585) | 20.71 | 76.68 | 0.00 | 23.32 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 7586) | 20.70 | 76.70 | 0.00 | 23.23 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 7587) | 20.70 | 76.74 | 0.00 | 23.21 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 7588) | 20.75 | 76.60 | 0.00 | 23.33 | 0.00 | 0.00 | 0.02 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 7589) | 20.73 | 76.68 | 0.00 | 23.30 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) | Pthread (%) |
---|---|---|---|---|---|
run_1_thread | 1 | 99.98 | 0 | 0.02 | 0 |
run_2_threads | 2 | 96.84 | 3.15 | 0.01 | 0 |
run_4_threads | 4 | 91.71 | 8.26 | 0 | 0.03 |
run_8_threads | 8 | 83.04 | 16.9 | 0.02 | 0.03 |
run_10_threads | 10 | 79.24 | 20.71 | 0.01 | 0.03 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) | Pthread (s) |
---|---|---|---|---|---|---|
run_1_thread | 1 | 150.76 | 150.73 | 0 | 0.02 | 0 |
run_2_threads | 2 | 79.67 | 77.16 | 2.51 | 0.01 | 0 |
run_4_threads | 4 | 46.21 | 42.38 | 3.82 | 0 | 0.01 |
run_8_threads | 8 | 27.83 | 23.11 | 4.7 | 0 | 0.01 |
run_10_threads | 10 | 23.87 | 18.92 | 4.94 | 0 | 0.01 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.94 |
run_4_threads | 4 | 0.81 |
run_8_threads | 8 | 0.68 |
run_10_threads | 10 | 0.63 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.85 | 3.15 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 87.3 | 4.41 | 8.27 |
run_8_threads | 8 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 83.04 | 0 | 16.92 |
run_10_threads | 10 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 75.49 | 3.75 | 0 | 20.72 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3.15 | 96.85 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8.27 | 91.73 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16.92 | 83.08 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20.72 | 79.28 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libomp.so | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/libstdc++.so.6.0.34 |