Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 158.25 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 158.25 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5273 | 158.25 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5273) | 158.25 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 89.68 | 99.85 | 0.00 | 0.13 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 89.68 | 99.85 | 0.00 | 0.13 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5307 | 89.68 | 99.85 | 0.00 | 0.13 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5307) | 89.68 | 99.86 | 0.00 | 0.12 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 |
○OMP # 1 (TID 5324) | 82.73 | 99.84 | 0.00 | 0.14 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 47.67 | 99.61 | 0.00 | 0.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 47.67 | 99.61 | 0.00 | 0.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5340 | 47.67 | 99.61 | 0.00 | 0.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5340) | 47.67 | 99.74 | 0.00 | 0.25 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 5357) | 40.71 | 99.69 | 0.00 | 0.31 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 5358) | 40.72 | 99.50 | 0.00 | 0.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 5359) | 40.71 | 99.50 | 0.00 | 0.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 29.81 | 99.22 | 0.00 | 0.76 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 29.81 | 99.22 | 0.00 | 0.76 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5375 | 29.81 | 99.22 | 0.00 | 0.76 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5375) | 29.81 | 99.58 | 0.00 | 0.39 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 |
○OMP # 1 (TID 5392) | 22.84 | 99.39 | 0.00 | 0.55 | 0.00 | 0.00 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 5393) | 22.83 | 99.08 | 0.00 | 0.92 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 5394) | 22.84 | 99.12 | 0.00 | 0.85 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 5395) | 22.83 | 99.23 | 0.00 | 0.77 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 5396) | 22.84 | 99.06 | 0.00 | 0.90 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 5397) | 22.83 | 99.12 | 0.00 | 0.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 5398) | 22.84 | 99.08 | 0.00 | 0.92 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 25.31 | 99.04 | 0.00 | 0.95 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 25.31 | 99.04 | 0.00 | 0.95 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 5415 | 25.31 | 99.04 | 0.00 | 0.95 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 5415) | 25.31 | 99.53 | 0.00 | 0.45 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 5432) | 18.34 | 99.16 | 0.00 | 0.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 5433) | 18.35 | 98.94 | 0.00 | 1.04 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 5434) | 18.35 | 98.86 | 0.00 | 1.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 5435) | 18.35 | 99.10 | 0.00 | 0.87 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 5436) | 18.35 | 98.86 | 0.00 | 1.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 5437) | 18.35 | 98.88 | 0.00 | 1.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 5438) | 18.35 | 98.77 | 0.00 | 1.17 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 5439) | 18.34 | 98.96 | 0.00 | 1.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 5440) | 18.34 | 99.18 | 0.00 | 0.82 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) |
---|---|---|---|---|
run_1_thread | 1 | 99.98 | 0 | 0.02 |
run_2_threads | 2 | 99.85 | 0.13 | 0.02 |
run_4_threads | 4 | 99.61 | 0.39 | 0 |
run_8_threads | 8 | 99.22 | 0.76 | 0.02 |
run_10_threads | 10 | 99.04 | 0.95 | 0.01 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 158.25 | 158.21 | 0 | 0.03 |
run_2_threads | 2 | 89.68 | 89.55 | 0.12 | 0.02 |
run_4_threads | 4 | 47.67 | 47.49 | 0.18 | 0 |
run_8_threads | 8 | 29.81 | 29.57 | 0.23 | 0.01 |
run_10_threads | 10 | 25.31 | 25.07 | 0.24 | 0 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.88 |
run_4_threads | 4 | 0.83 |
run_8_threads | 8 | 0.66 |
run_10_threads | 10 | 0.62 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.71 | 4.15 | 0.13 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 99.61 | 0 | 0.39 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 95.47 | 3.77 | 0 | 0 | 0.76 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 95.28 | 3.76 | 0 | 0.01 | 0.95 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.13 | 99.87 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.39 | 99.61 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.76 | 99.24 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.95 | 99.05 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libgomp.so.1.0.0 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libstdc++.so.6.0.34 |