| Run ICPX O3 + More Aggressive Flags | Run ICPX O3 + More Aggressive Flags | Run ACFL O3 + Vectorize + Funroll + Ffastmath | Run G++ O3 + Funroll |
| | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-70
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
|
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 10 | 75.41 | 21.34 | 17.13 | 8 | 6.24 | 0.00 | 29.28 | 13 | 70.82 | 14.12 | 10.85 | 8 | 6.91 | 0.00 | 0.03 | 9 | 91.72 | 16.24 | 10.32 | 8 | 14.75 | 0.01 | 24.96 |
| Run ICPX O3 + More Aggressive Flags | Run ICPX O3 + More Aggressive Flags | Run ACFL O3 + Vectorize + Funroll + Ffastmath | Run G++ O3 + Funroll |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 70-82
| | | | | | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 11 | 70.35 | 14.43 | 12.53 | 8 | 4.12 | 0.07 | 43.31 | | | |
| Run ICPX O3 + More Aggressive Flags | Run ICPX O3 + More Aggressive Flags | Run ACFL O3 + Vectorize + Funroll + Ffastmath | Run G++ O3 + Funroll |
| | | | | | | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 1164 | 0.03 | 0.01 | 0.01 | 4 | 0.03 | 0.01 | 0.00 | 1164 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 | 1 | 0.05 | 0.01 | 0.01 | 7 | 0.02 | 0.00 | 0.00 | 280 | 0.04 | 0.01 | 0.01 | 5 | 0.04 | 0.00 | 0.00 |
| 1106 | 0.60 | 0.12 | 0.15 | 8 | 0.26 | 0.04 | 0.00 | 1106 | 0.19 | 0.05 | 0.08 | 7 | 0.06 | 0.01 | 0.00 | 528 | 1.69 | 0.34 | 0.33 | 7 | 0.29 | 0.04 | 0.00 | 276 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 |
| 2813 | 0.01 | 0.00 | 0.01 | 3 | 0.00 | 0.00 | 0.00 | 2813 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1241 | 0.02 | 0.00 | 0.01 | 7 | 0.01 | 0.00 | 0.00 | -1 | 0.00 | 0.00 | 0.00 | 8 | 0.00 | 0.00 | NA |
| 1102 | 23.45 | 4.81 | 4.74 | 8 | 9.10 | 1.55 | 0.00 | 1102 | 18.22 | 5.16 | 4.75 | 7 | 0.05 | 0.01 | 0.00 | 854 | 18.98 | 3.78 | 3.46 | 8 | 8.02 | 1.18 | 0.00 | |
| -1 | 0.03 | 0.01 | 0.02 | 3 | 0.05 | 0.01 | 0.00 | -1 | 0.02 | 0.01 | 0.02 | 4 | 0.03 | 0.01 | 0.00 | 1281 | 0.02 | 0.00 | 0.01 | 7 | 0.02 | 0.00 | 0.00 | |
| | -1 | 0.15 | 0.03 | 0.03 | 7 | 0.04 | 0.01 | 0.00 | |
| | 10 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | |
| | 1137 | 0.87 | 0.17 | 0.16 | 7 | 0.08 | 0.01 | 0.00 | |
| | 789 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | |
| | 3038 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | |
| | -1 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | NA | |
| Run ICPX O3 + More Aggressive Flags | Run ICPX O3 + More Aggressive Flags | Run ACFL O3 + Vectorize + Funroll + Ffastmath | Run G++ O3 + Funroll |
| | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-59
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
|
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 8 | 6.15 | 1.74 | 11.17 | 1 | 0.00 | 0.00 | 2.87 | 12 | 7.40 | 1.48 | 9.07 | 1 | 0.00 | 0.00 | 0.88 | 13 | 8.23 | 1.46 | 7.39 | 1 | 0.00 | 0.00 | 3.42 |
| Run ICPX O3 + More Aggressive Flags | Run ICPX O3 + More Aggressive Flags | Run ACFL O3 + Vectorize + Funroll + Ffastmath | Run G++ O3 + Funroll |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 67-71
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 85-108
| | | | | | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 9 | 5.52 | 1.13 | 7.81 | 1 | 0.00 | 0.00 | 4.41 | | | |
| Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
| ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll | ICPX O3 + More Aggressive Flags | ICPX O3 + More Aggressive Flags | ACFL O3 + Vectorize + Funroll + Ffastmath | G++ O3 + Funroll |
| k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .extracted] | binary | 70.35 | 75.41 | NA | NA | 14.43 | 21.34 | NA | NA | 12.53 | 17.13 | NA | NA | 8 | 8 | NA | NA | 43.31 | 29.28 | NA | NA | 4.12 | 6.24 | NA | NA | 0.07 | 0.00 | NA | NA |
| k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0] | binary | NA | NA | NA | 91.72 | NA | NA | NA | 16.24 | NA | NA | NA | 10.32 | NA | NA | NA | 8 | NA | NA | NA | 24.96 | NA | NA | NA | 14.75 | NA | NA | NA | 0.01 |
| k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined] | binary | NA | NA | 70.82 | NA | NA | NA | 14.12 | NA | NA | NA | 10.85 | NA | NA | NA | 8 | NA | NA | NA | 0.03 | NA | NA | NA | 6.91 | NA | NA | NA | 0.00 | NA |
| kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 23.45 | 18.22 | NA | NA | 4.81 | 5.16 | NA | NA | 4.74 | 4.75 | NA | NA | 8 | 7 | NA | NA | 0.00 | 0.00 | NA | NA | 9.10 | 0.05 | NA | NA | 1.55 | 0.01 | NA | NA |
| k_means(int, point_t*, point_t*, int*, point_t*, int, int) | binary | 5.52 | 6.15 | 7.40 | 8.23 | 1.13 | 1.74 | 1.48 | 1.46 | 7.81 | 11.17 | 9.07 | 7.39 | 1 | 1 | 1 | 1 | 4.41 | 2.87 | 0.88 | 3.42 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | NA | NA | 18.98 | NA | NA | NA | 3.78 | NA | NA | NA | 3.46 | NA | NA | NA | 8 | NA | NA | NA | 0.00 | NA | NA | NA | 8.02 | NA | NA | NA | 1.18 | NA |
| kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | NA | NA | 1.69 | NA | NA | NA | 0.34 | NA | NA | NA | 0.33 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.29 | NA | NA | NA | 0.04 | NA |
| __sched_yield | libc.so.6 | 0.03 | 0.01 | 0.87 | NA | 0.01 | 0.00 | 0.17 | NA | 0.01 | 0.00 | 0.16 | NA | 4 | 2 | 7 | NA | 0.00 | 0.00 | 0.00 | NA | 0.03 | 0.00 | 0.08 | NA | 0.01 | 0.00 | 0.01 | NA |
| kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 0.60 | 0.19 | NA | NA | 0.12 | 0.05 | NA | NA | 0.15 | 0.08 | NA | NA | 8 | 7 | NA | NA | 0.00 | 0.00 | NA | NA | 0.26 | 0.06 | NA | NA | 0.04 | 0.01 | NA | NA |
| unknown_function | [vdso] | NA | NA | 0.15 | NA | NA | NA | 0.03 | NA | NA | NA | 0.03 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.04 | NA | NA | NA | 0.01 | NA |
| @plt_start@ | libomp.so | NA | NA | 0.05 | NA | NA | NA | 0.01 | NA | NA | NA | 0.01 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.02 | NA | NA | NA | 0.00 | NA |
| unknown_kernel_region | kernel | 0.03 | 0.02 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 3 | 4 | 1 | 8 | 0.00 | 0.00 | NA | NA | 0.05 | 0.03 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 |
| gomp_team_barrier_wait_end | libgomp.so.1.0.0 | NA | NA | NA | 0.04 | NA | NA | NA | 0.01 | NA | NA | NA | 0.01 | NA | NA | NA | 5 | NA | NA | NA | 0.00 | NA | NA | NA | 0.04 | NA | NA | NA | 0.00 |
| __kmp_now_nsec | libomp.so | NA | NA | 0.02 | NA | NA | NA | 0.00 | NA | NA | NA | 0.01 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.02 | NA | NA | NA | 0.00 | NA |
| __kmp_yield | libomp.so | NA | NA | 0.02 | NA | NA | NA | 0.00 | NA | NA | NA | 0.01 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.01 | NA | NA | NA | 0.00 | NA |
| __kmp_yield | libiomp5.so | 0.01 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.01 | 0.00 | NA | NA | 3 | 1 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA |
| gomp_barrier_wait_end | libgomp.so.1.0.0 | NA | NA | NA | 0.01 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
| std::ostream& std::ostream::_M_insert<double>(double) | libstdc++.so.6.0.33 | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |
| _dl_rtld_di_serinfo | ld-linux-aarch64.so.1 | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |
| __default_morecore | libc.so.6 | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |