| Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
|
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 15 | 70.87 | 1.49 | 1.75 | 64 | 8.16 | 0.18 | 17.04 | 14 | 78.71 | 1.83 | 2.18 | 64 | 6.98 | 0.18 | 460.38 |
| Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| | | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 1 | 0.05 | 0.00 | 0.01 | 12 | 0.02 | 0.00 | 0.00 | 1 | 0.09 | 0.00 | 0.01 | 23 | 0.08 | 0.00 | 0.00 |
| 528 | 0.99 | 0.02 | 0.05 | 62 | 0.56 | 0.01 | 0.00 | 529 | 0.97 | 0.02 | 0.05 | 62 | 0.60 | 0.01 | 0.00 |
| 1241 | 0.04 | 0.00 | 0.01 | 8 | 0.11 | 0.00 | 0.00 | 1902 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 |
| 854 | 19.33 | 0.41 | 0.63 | 64 | 6.21 | 0.11 | 0.00 | 1242 | 0.03 | 0.00 | 0.01 | 8 | 0.08 | 0.00 | 0.00 |
| 437 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 855 | 15.08 | 0.35 | 0.75 | 64 | 6.07 | 0.13 | 0.00 |
| 1137 | 0.83 | 0.02 | 0.04 | 59 | 0.53 | 0.01 | 0.00 | 492 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | NA |
| -1 | 0.00 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | NA | 71 | 0.46 | 0.01 | 0.03 | 54 | 0.32 | 0.01 | 0.00 |
| 16 | 4.65 | 0.11 | 0.13 | 64 | 1.16 | 0.03 | 41.56 |
| -1 | 0.00 | 0.00 | 0.00 | 21 | 0.00 | 0.00 | NA |
| Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
| | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 17 | 7.87 | 0.17 | 0.19 | 64 | 2.42 | 0.05 | 1.20 | |
| Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
| Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast |
| k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | 70.87 | 78.71 | 1.49 | 1.83 | 1.75 | 2.18 | 64 | 64 | 17.04 | 460.38 | 8.16 | 6.98 | 0.18 | 0.18 |
| kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 19.33 | 15.08 | 0.41 | 0.35 | 0.63 | 0.75 | 64 | 64 | 0.00 | 0.00 | 6.21 | 6.07 | 0.11 | 0.13 |
| k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | 7.87 | 4.65 | 0.17 | 0.11 | 0.19 | 0.13 | 64 | 64 | 1.20 | 41.56 | 2.42 | 1.16 | 0.05 | 0.03 |
| kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 0.99 | 0.97 | 0.02 | 0.02 | 0.05 | 0.05 | 62 | 62 | 0.00 | 0.00 | 0.56 | 0.60 | 0.01 | 0.01 |
| __sched_yield | libc.so.6 | 0.83 | NA | 0.02 | NA | 0.04 | NA | 59 | NA | 0.00 | NA | 0.53 | NA | 0.01 | NA |
| __sched_yield | libc.so.6 | NA | 0.46 | NA | 0.01 | NA | 0.03 | NA | 54 | NA | 0.00 | NA | 0.32 | NA | 0.01 |
| @plt_start@ | libomp.so | 0.05 | 0.09 | 0.00 | 0.00 | 0.01 | 0.01 | 12 | 23 | 0.00 | 0.00 | 0.02 | 0.08 | 0.00 | 0.00 |
| __kmp_yield | libomp.so | 0.04 | 0.03 | 0.00 | 0.00 | 0.01 | 0.01 | 8 | 8 | 0.00 | 0.00 | 0.11 | 0.08 | 0.00 | 0.00 |
| __kmp_resume_if_soft_paused | libomp.so | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
| __aarch64_ldadd8_acq_rel | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.01 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
| unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 21 | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_invoke_task_func | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | NA | NA | 0.00 | NA | 0.00 |