Run CASCADE LAKE | ICPX O3 + More Aggressive Flags | Run SKYLAKE | ICPX O3 + More Aggressive Flags | Run NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | Run NEOVERSE V2 | G++ O3 + Funroll |
| | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-70
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 58-67
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 10 | 75.41 | 21.34 | 17.13 | 8 | 6.24 | 0.00 | 29.28 | 13 | 70.84 | 14.01 | 10.84 | 8 | 6.86 | 0.02 | 0.29 | 9 | 91.72 | 16.24 | 10.32 | 8 | 14.75 | 0.01 | 24.96 |
Run CASCADE LAKE | ICPX O3 + More Aggressive Flags | Run SKYLAKE | ICPX O3 + More Aggressive Flags | Run NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | Run NEOVERSE V2 | G++ O3 + Funroll |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 70-82
| | | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
11 | 70.35 | 14.43 | 12.53 | 8 | 4.12 | 0.07 | 43.31 | | | |
Run CASCADE LAKE | ICPX O3 + More Aggressive Flags | Run SKYLAKE | ICPX O3 + More Aggressive Flags | Run NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | Run NEOVERSE V2 | G++ O3 + Funroll |
| | | | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
1164 | 0.03 | 0.01 | 0.01 | 4 | 0.03 | 0.01 | 0.00 | 1164 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 | 1 | 0.07 | 0.01 | 0.02 | 7 | 0.03 | 0.00 | 0.00 | 280 | 0.04 | 0.01 | 0.01 | 5 | 0.04 | 0.00 | 0.00 |
1106 | 0.60 | 0.12 | 0.15 | 8 | 0.26 | 0.04 | 0.00 | 1106 | 0.19 | 0.05 | 0.08 | 7 | 0.06 | 0.01 | 0.00 | 528 | 1.69 | 0.33 | 0.39 | 8 | 0.75 | 0.11 | 0.00 | 276 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 |
2813 | 0.01 | 0.00 | 0.01 | 3 | 0.00 | 0.00 | 0.00 | 2813 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1241 | 0.04 | 0.01 | 0.01 | 5 | 0.03 | 0.00 | 0.00 | -1 | 0.00 | 0.00 | 0.00 | 8 | 0.00 | 0.00 | NA |
1102 | 23.45 | 4.81 | 4.74 | 8 | 9.10 | 1.55 | 0.00 | 1102 | 18.22 | 5.16 | 4.75 | 7 | 0.05 | 0.01 | 0.00 | 854 | 19.09 | 3.78 | 3.42 | 8 | 8.00 | 1.17 | 0.00 | |
-1 | 0.03 | 0.01 | 0.02 | 3 | 0.05 | 0.01 | 0.00 | -1 | 0.02 | 0.01 | 0.02 | 4 | 0.03 | 0.01 | 0.00 | 1281 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 | |
| | -1 | 0.12 | 0.02 | 0.03 | 7 | 0.06 | 0.01 | 0.00 | |
| | 1137 | 0.82 | 0.16 | 0.19 | 7 | 0.16 | 0.02 | 0.00 | |
Run CASCADE LAKE | ICPX O3 + More Aggressive Flags | Run SKYLAKE | ICPX O3 + More Aggressive Flags | Run NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | Run NEOVERSE V2 | G++ O3 + Funroll |
| | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-59
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 55-58
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 73-96
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 8 | 6.15 | 1.74 | 11.17 | 1 | 0.00 | 0.00 | 2.87 | 12 | 7.33 | 1.45 | 8.94 | 1 | 0.00 | 0.00 | 1.30 | 13 | 8.23 | 1.46 | 7.39 | 1 | 0.00 | 0.00 | 3.42 |
Run CASCADE LAKE | ICPX O3 + More Aggressive Flags | Run SKYLAKE | ICPX O3 + More Aggressive Flags | Run NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | Run NEOVERSE V2 | G++ O3 + Funroll |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 67-71
- /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 85-108
| | | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
9 | 5.52 | 1.13 | 7.81 | 1 | 0.00 | 0.00 | 4.41 | | | |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll | CASCADE LAKE | ICPX O3 + More Aggressive Flags | SKYLAKE | ICPX O3 + More Aggressive Flags | NEOVERSE V1 | ACFL O3 + Funroll + Ffastmath | NEOVERSE V2 | G++ O3 + Funroll |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .extracted] | binary | 70.35 | 75.41 | NA | NA | 14.43 | 21.34 | NA | NA | 12.53 | 17.13 | NA | NA | 8 | 8 | NA | NA | 43.31 | 29.28 | NA | NA | 4.12 | 6.24 | NA | NA | 0.07 | 0.00 | NA | NA |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0] | binary | NA | NA | NA | 91.72 | NA | NA | NA | 16.24 | NA | NA | NA | 10.32 | NA | NA | NA | 8 | NA | NA | NA | 24.96 | NA | NA | NA | 14.75 | NA | NA | NA | 0.01 |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined] | binary | NA | NA | 70.84 | NA | NA | NA | 14.01 | NA | NA | NA | 10.84 | NA | NA | NA | 8 | NA | NA | NA | 0.29 | NA | NA | NA | 6.86 | NA | NA | NA | 0.02 | NA |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 23.45 | 18.22 | NA | NA | 4.81 | 5.16 | NA | NA | 4.74 | 4.75 | NA | NA | 8 | 7 | NA | NA | 0.00 | 0.00 | NA | NA | 9.10 | 0.05 | NA | NA | 1.55 | 0.01 | NA | NA |
k_means(int, point_t*, point_t*, int*, point_t*, int, int) | binary | 5.52 | 6.15 | 7.33 | 8.23 | 1.13 | 1.74 | 1.45 | 1.46 | 7.81 | 11.17 | 8.94 | 7.39 | 1 | 1 | 1 | 1 | 4.41 | 2.87 | 1.30 | 3.42 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | NA | NA | 19.09 | NA | NA | NA | 3.78 | NA | NA | NA | 3.42 | NA | NA | NA | 8 | NA | NA | NA | 0.00 | NA | NA | NA | 8.00 | NA | NA | NA | 1.17 | NA |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | NA | NA | 1.69 | NA | NA | NA | 0.33 | NA | NA | NA | 0.39 | NA | NA | NA | 8 | NA | NA | NA | 0.00 | NA | NA | NA | 0.75 | NA | NA | NA | 0.11 | NA |
__sched_yield | libc.so.6 | 0.03 | 0.01 | 0.82 | NA | 0.01 | 0.00 | 0.16 | NA | 0.01 | 0.00 | 0.19 | NA | 4 | 2 | 7 | NA | 0.00 | 0.00 | 0.00 | NA | 0.03 | 0.00 | 0.16 | NA | 0.01 | 0.00 | 0.02 | NA |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 0.60 | 0.19 | NA | NA | 0.12 | 0.05 | NA | NA | 0.15 | 0.08 | NA | NA | 8 | 7 | NA | NA | 0.00 | 0.00 | NA | NA | 0.26 | 0.06 | NA | NA | 0.04 | 0.01 | NA | NA |
unknown_function | [vdso] | NA | NA | 0.12 | NA | NA | NA | 0.02 | NA | NA | NA | 0.03 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.06 | NA | NA | NA | 0.01 | NA |
@plt_start@ | libomp.so | NA | NA | 0.07 | NA | NA | NA | 0.01 | NA | NA | NA | 0.02 | NA | NA | NA | 7 | NA | NA | NA | 0.00 | NA | NA | NA | 0.03 | NA | NA | NA | 0.00 | NA |
unknown_kernel_region | kernel | 0.03 | 0.02 | NA | 0.00 | 0.01 | 0.01 | NA | 0.00 | 0.02 | 0.02 | NA | 0.00 | 3 | 4 | NA | 8 | 0.00 | 0.00 | NA | NA | 0.05 | 0.03 | NA | 0.00 | 0.01 | 0.01 | NA | 0.00 |
gomp_team_barrier_wait_end | libgomp.so.1.0.0 | NA | NA | NA | 0.04 | NA | NA | NA | 0.01 | NA | NA | NA | 0.01 | NA | NA | NA | 5 | NA | NA | NA | 0.00 | NA | NA | NA | 0.04 | NA | NA | NA | 0.00 |
__kmp_yield | libomp.so | NA | NA | 0.04 | NA | NA | NA | 0.01 | NA | NA | NA | 0.01 | NA | NA | NA | 5 | NA | NA | NA | 0.00 | NA | NA | NA | 0.03 | NA | NA | NA | 0.00 | NA |
__kmp_yield | libiomp5.so | 0.01 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.01 | 0.00 | NA | NA | 3 | 1 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA |
gomp_barrier_wait_end | libgomp.so.1.0.0 | NA | NA | NA | 0.01 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
__kmp_now_nsec | libomp.so | NA | NA | 0.01 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |