Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 243.44 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 243.44 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2160726 | 243.44 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2160726) | 243.44 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 127.16 | 98.06 | 0.00 | 1.92 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 127.16 | 98.06 | 0.00 | 1.92 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2160815 | 127.16 | 98.06 | 0.00 | 1.92 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2160815) | 127.16 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 2160872) | 121.01 | 96.03 | 0.00 | 3.93 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 69.05 | 94.41 | 0.00 | 5.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 69.05 | 94.41 | 0.00 | 5.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2160897 | 69.05 | 94.41 | 0.00 | 5.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2160897) | 69.05 | 99.99 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 2160954) | 62.91 | 92.37 | 0.00 | 7.61 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 2160955) | 62.91 | 92.39 | 0.00 | 7.61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 2160956) | 62.91 | 92.37 | 0.00 | 7.63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 39.99 | 87.89 | 0.00 | 12.08 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 39.99 | 87.89 | 0.00 | 12.08 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2160977 | 39.99 | 87.89 | 0.00 | 12.08 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2160977) | 39.99 | 99.95 | 0.00 | 0.01 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 |
○OMP # 1 (TID 2161034) | 33.84 | 85.87 | 0.00 | 14.10 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 2161035) | 33.84 | 85.86 | 0.00 | 14.10 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 2161036) | 33.85 | 85.85 | 0.00 | 14.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 2161037) | 33.86 | 85.85 | 0.00 | 14.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 2161038) | 33.84 | 85.85 | 0.00 | 14.11 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 2161039) | 33.84 | 85.86 | 0.00 | 14.11 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 2161040) | 33.84 | 85.87 | 0.00 | 14.11 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_16_threads | 25.47 | 77.20 | 0.00 | 22.78 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 25.47 | 77.20 | 0.00 | 22.78 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2161058 | 25.47 | 77.20 | 0.00 | 22.78 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2161058) | 25.47 | 99.96 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 2161115) | 19.32 | 75.21 | 0.00 | 24.79 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 2161116) | 19.32 | 75.18 | 0.00 | 24.82 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 2161117) | 19.32 | 75.18 | 0.00 | 24.79 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 2161118) | 19.33 | 75.19 | 0.00 | 24.81 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 2161119) | 19.32 | 75.21 | 0.00 | 24.79 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 2161120) | 19.32 | 75.18 | 0.00 | 24.79 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 2161121) | 19.32 | 75.21 | 0.00 | 24.77 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 2161122) | 19.33 | 75.21 | 0.00 | 24.73 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 2161123) | 19.32 | 75.16 | 0.00 | 24.79 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 10 (TID 2161124) | 19.32 | 75.21 | 0.00 | 24.74 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 11 (TID 2161125) | 19.32 | 75.21 | 0.00 | 24.77 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 12 (TID 2161126) | 19.33 | 75.17 | 0.00 | 24.75 | 0.00 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 13 (TID 2161127) | 19.32 | 75.21 | 0.00 | 24.77 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 14 (TID 2161128) | 19.32 | 75.18 | 0.00 | 24.82 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 15 (TID 2161129) | 19.32 | 75.23 | 0.00 | 24.77 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_26_threads | 19.89 | 67.05 | 0.00 | 32.90 | 0.00 | 0.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node skylake | 19.89 | 67.05 | 0.00 | 32.90 | 0.00 | 0.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 2161148 | 19.89 | 67.05 | 0.00 | 32.90 | 0.00 | 0.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 2161148) | 19.89 | 99.87 | 0.00 | 0.05 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 2161205) | 13.72 | 65.16 | 0.00 | 34.80 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 2161206) | 13.72 | 65.16 | 0.00 | 34.73 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 2161207) | 13.72 | 65.16 | 0.00 | 34.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 2161208) | 13.73 | 65.14 | 0.00 | 34.75 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 2161209) | 13.72 | 65.16 | 0.00 | 34.77 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 2161210) | 13.72 | 65.12 | 0.00 | 34.77 | 0.00 | 0.00 | 0.04 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 2161211) | 13.72 | 65.16 | 0.00 | 34.73 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 2161212) | 13.73 | 65.21 | 0.00 | 34.72 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 2161213) | 13.72 | 65.12 | 0.00 | 34.84 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 10 (TID 2161214) | 13.72 | 65.12 | 0.00 | 34.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 11 (TID 2161215) | 13.72 | 65.12 | 0.00 | 34.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 12 (TID 2161216) | 13.72 | 65.17 | 0.00 | 34.79 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 13 (TID 2161217) | 13.72 | 65.12 | 0.00 | 34.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 14 (TID 2161218) | 13.72 | 65.16 | 0.00 | 34.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 15 (TID 2161219) | 13.72 | 65.16 | 0.00 | 34.80 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 16 (TID 2161220) | 13.73 | 65.15 | 0.00 | 34.81 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 17 (TID 2161221) | 13.72 | 65.12 | 0.00 | 34.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 18 (TID 2161222) | 13.72 | 65.16 | 0.00 | 34.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 19 (TID 2161223) | 13.72 | 65.16 | 0.00 | 34.80 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 20 (TID 2161224) | 13.73 | 65.21 | 0.00 | 34.72 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 21 (TID 2161225) | 13.72 | 65.10 | 0.00 | 34.90 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 22 (TID 2161226) | 13.72 | 65.12 | 0.00 | 34.84 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 23 (TID 2161227) | 13.72 | 65.12 | 0.00 | 34.73 | 0.00 | 0.00 | 0.04 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 24 (TID 2161228) | 13.72 | 65.17 | 0.00 | 34.79 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 25 (TID 2161229) | 13.72 | 65.17 | 0.00 | 34.79 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) | Pthread (%) |
---|---|---|---|---|---|
run_1_thread | 1 | 99.99 | 0 | 0.01 | 0 |
run_2_threads | 2 | 98.06 | 1.92 | 0.01 | 0.01 |
run_4_threads | 4 | 94.41 | 5.58 | 0 | 0 |
run_8_threads | 8 | 87.89 | 12.08 | 0.01 | 0.02 |
run_16_threads | 16 | 77.2 | 22.78 | 0 | 0.02 |
run_26_threads | 26 | 67.05 | 32.9 | 0.01 | 0.04 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) | Pthread (s) |
---|---|---|---|---|---|---|
run_1_thread | 1 | 243.44 | 243.42 | 0 | 0.01 | 0 |
run_2_threads | 2 | 127.16 | 124.69 | 2.44 | 0.01 | 0.01 |
run_4_threads | 4 | 69.05 | 65.2 | 3.85 | 0 | 0 |
run_8_threads | 8 | 39.99 | 35.15 | 4.83 | 0 | 0.01 |
run_16_threads | 16 | 25.47 | 19.66 | 5.8 | 0 | 0.01 |
run_26_threads | 26 | 19.88 | 13.33 | 6.54 | 0 | 0.01 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.96 |
run_4_threads | 4 | 0.88 |
run_8_threads | 8 | 0.76 |
run_16_threads | 16 | 0.6 |
run_26_threads | 26 | 0.47 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0.2 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 98.06 | 1.72 |
run_4_threads | 4 | 0.57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 94.42 | 5.01 |
run_8_threads | 8 | 1.16 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 87.89 | 0 | 10.94 |
run_16_threads | 16 | 2.31 | 0 | 0.01 | 0 | 0 | 0 | 0 | 77.2 | 0 | 0 | 20.49 |
run_26_threads | 26 | 3.21 | 0.02 | 0 | 0 | 0 | 0 | 0 | 67.05 | 0 | 0 | 29.72 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0.2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 1.72 | 98.06 | 0 |
run_4_threads | 4 | 0.57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.01 | 94.42 | 0 |
run_8_threads | 8 | 1.16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10.94 | 87.91 | 0 |
run_16_threads | 16 | 2.31 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20.49 | 77.21 | 0 |
run_26_threads | 26 | 3.21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 29.72 | 67.08 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_16_threads | run_26_threads |
---|---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | ||||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | ||||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | ||||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | ||||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | ||||||
/usr/lib/ld-linux-x86-64.so.2 | ||||||
/usr/lib/libc.so.6 | ||||||
/usr/lib/libdl.so.2 | ||||||
/usr/lib/libgcc_s.so.1 | ||||||
/usr/lib/libm.so.6 | ||||||
/usr/lib/libomp.so | ||||||
/usr/lib/libpthread.so.0 | ||||||
/usr/lib/libstdc++.so.6.0.34 |