Run G++ O3 + Funroll | Run Clang++ O3 + Ffastmath | Run ICPX O3 + More Aggressive Flags |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-70
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
7 | 86.03 | 14.39 | 5.86 | 26 | 11.08 | 0.01 | 43.41 | 8 | 57.68 | 10.39 | 6.86 | 26 | 4.10 | 0.01 | 60.12 | 10 | 51.15 | 8.41 | 5.31 | 26 | 3.99 | 0.01 | 74.26 |
Run G++ O3 + Funroll | Run Clang++ O3 + Ffastmath | Run ICPX O3 + More Aggressive Flags |
| | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
299 | 7.56 | 1.27 | 0.56 | 25 | 0.18 | 0.01 | 0.00 | 1164 | 0.01 | 0.00 | 0.01 | 8 | 0.02 | 0.00 | 0.00 | 1164 | 0.01 | 0.00 | 0.01 | 8 | 0.00 | 0.00 | 0.00 |
303 | 0.24 | 0.04 | 0.04 | 26 | 0.16 | 0.01 | 0.00 | 1866 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1106 | 0.56 | 0.09 | 0.08 | 25 | 0.13 | 0.01 | 0.00 |
-1 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 | 596 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1095 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 |
| 700 | 0.01 | 0.00 | 0.01 | 4 | 0.02 | 0.00 | 0.00 | 2813 | 0.00 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 |
| 651 | 34.95 | 6.29 | 4.42 | 26 | 7.29 | 0.85 | 0.00 | 1102 | 44.06 | 7.24 | 4.76 | 25 | 0.20 | 0.02 | 0.00 |
| 1055 | 3.71 | 0.67 | 0.55 | 25 | 0.43 | 0.05 | 0.00 | -1 | 0.04 | 0.01 | 0.01 | 16 | 0.03 | 0.00 | 0.00 |
| -1 | 0.03 | 0.00 | 0.01 | 12 | 0.02 | 0.00 | 0.00 | |
Run G++ O3 + Funroll | Run Clang++ O3 + Ffastmath | Run ICPX O3 + More Aggressive Flags |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-59
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
11 | 6.16 | 1.03 | 10.88 | 1 | 0.00 | 0.00 | 4.85 | 7 | 3.61 | 0.65 | 11.15 | 1 | 0.00 | 0.00 | 7.69 | 8 | 4.17 | 0.69 | 11.16 | 1 | 0.00 | 0.00 | 7.30 |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags | G++ O3 + Funroll | Clang++ O3 + Ffastmath | ICPX O3 + More Aggressive Flags |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0] | binary | 86.03 | NA | NA | 14.39 | NA | NA | 5.86 | NA | NA | 26 | NA | NA | 43.41 | NA | NA | 11.08 | NA | NA | 0.01 | NA | NA |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined] | binary | NA | 57.68 | NA | NA | 10.39 | NA | NA | 6.86 | NA | NA | 26 | NA | NA | 60.12 | NA | NA | 4.10 | NA | NA | 0.01 | NA |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .extracted] | binary | NA | NA | 51.15 | NA | NA | 8.41 | NA | NA | 5.31 | NA | NA | 26 | NA | NA | 74.26 | NA | NA | 3.99 | NA | NA | 0.01 |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | NA | NA | 44.06 | NA | NA | 7.24 | NA | NA | 4.76 | NA | NA | 25 | NA | NA | 0.00 | NA | NA | 0.20 | NA | NA | 0.02 |
__kmpc_threadprivate_register_vec | libomp.so | NA | 34.95 | NA | NA | 6.29 | NA | NA | 4.42 | NA | NA | 26 | NA | NA | 0.00 | NA | NA | 7.29 | NA | NA | 0.85 | NA |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) | binary | 6.16 | 3.61 | 4.17 | 1.03 | 0.65 | 0.69 | 10.88 | 11.15 | 11.16 | 1 | 1 | 1 | 4.85 | 7.69 | 7.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
gomp_barrier_wait_end | libgomp.so.1.0.0 | 7.56 | NA | NA | 1.27 | NA | NA | 0.56 | NA | NA | 25 | NA | NA | 0.00 | NA | NA | 0.18 | NA | NA | 0.01 | NA | NA |
__kmp_invoke_microtask | libomp.so | NA | 3.71 | NA | NA | 0.67 | NA | NA | 0.55 | NA | NA | 25 | NA | NA | 0.00 | NA | NA | 0.43 | NA | NA | 0.05 | NA |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | NA | NA | 0.56 | NA | NA | 0.09 | NA | NA | 0.08 | NA | NA | 25 | NA | NA | 0.00 | NA | NA | 0.13 | NA | NA | 0.01 |
gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 0.24 | NA | NA | 0.04 | NA | NA | 0.04 | NA | NA | 26 | NA | NA | 0.00 | NA | NA | 0.16 | NA | NA | 0.01 | NA | NA |
unknown_kernel_region | kernel | 0.01 | 0.03 | 0.04 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 2 | 12 | 16 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 |
__sched_yield | libc.so.6 | NA | 0.01 | 0.01 | NA | 0.00 | 0.00 | NA | 0.01 | 0.01 | NA | 8 | 8 | NA | 0.00 | 0.00 | NA | 0.02 | 0.00 | NA | 0.00 | 0.00 |
__kmp_reap_worker | libomp.so | NA | 0.01 | NA | NA | 0.00 | NA | NA | 0.01 | NA | NA | 4 | NA | NA | 0.00 | NA | NA | 0.02 | NA | NA | 0.00 | NA |
__kmp_yield | libiomp5.so | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 2 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 |
__kmp_fork_barrier(int, int) | libiomp5.so | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.01 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 |
__libc_alloca_cutoff | libc.so.6 | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA |
__resolv_context_put | libc.so.6 | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA |