options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-acfl-Ofast99.8970.8344.3225.2413.847.224.944.730.000.000.000.000.000.000.000.009.049.039.048.868.928.748.588.990.000.000.000.000.000.000.000.009.046.414.012.251.240.640.430.430.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.550.781.242.183.977.5711.0011.57Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -greco...100.7120.860.5619.340.512.580.467.540.444.020.442.780.333.17
Loop 4 - main.cpp:21-100 - kmeans-acfl-Ofast [...]+99.8970.8344.3225.2413.847.224.944.730.000.000.000.000.000.000.000.009.049.039.048.868.928.748.588.990.000.000.000.000.000.000.000.009.046.414.012.251.240.640.430.430.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main.cpp:89-92 - kmeans-acfl-Ofast99.8970.8344.3225.2413.847.224.944.7399.8970.8344.3225.2413.847.224.944.739.049.039.048.868.928.748.588.999.049.039.048.868.928.748.588.999.046.414.012.251.240.640.430.439.046.414.012.251.240.640.430.43111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.550.781.242.183.977.5711.0011.57100.7120.860.5619.340.512.580.467.540.444.020.442.780.333.17
Loop 5 - main.cpp:94-100 - kmeans-acfl-Ofast0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined]+kmeans-acfl-Ofast0.110.270.560.870.912.363.6110.870.000.000.000.000.000.000.000.000.010.020.040.060.070.150.240.640.000.000.000.000.000.000.000.000.010.020.050.080.080.210.320.980.000.000.000.000.000.000.000.001248153248630.000.400.420.490.591.312.198.820.000.010.010.020.020.040.070.18Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.507.6513.7422.0042.5349.4761.7037.69Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -greco...100.20.220.050.540.020.850.010.902.3603.61010.87
Loop 7 - main.cpp:66-74 - kmeans-acfl-Ofast+0.110.270.560.870.912.363.6110.870.000.000.000.000.010.020.010.120.010.020.040.060.080.150.250.640.000.000.000.000.000.000.000.020.010.020.050.080.080.210.320.980.000.000.000.000.000.000.000.010236112844630.000.000.000.000.040.050.050.210.000.000.000.000.000.000.000.000.000.000.000.0055.6663.00156.4037.94
Loop 8 - main.cpp:68-74 - kmeans-acfl-Ofast0.110.270.560.870.902.343.6010.750.110.270.560.870.902.343.6010.750.010.020.040.060.070.150.240.620.010.020.040.060.070.150.240.620.010.020.050.080.080.210.310.970.010.020.050.080.080.210.310.971248153248630.000.400.420.490.591.302.198.710.000.010.010.020.020.040.070.170.507.4013.6221.6842.4149.3561.3237.69100.20.220.050.540.020.850.010.8902.3403.6010.75
unknown_function[vdso]0.000.120.340.610.570.570.610.560.000.000.000.000.000.000.000.000.000.010.030.050.040.050.040.050.000.000.000.000.000.000.000.000.000.010.030.050.050.050.050.050.000.000.000.000.000.000.000.000137153146610.000.000.300.400.260.300.260.320.000.000.010.010.010.010.010.01NAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
void __kmp_resume_template<kmp_flag_64<false, true> >(int, kmp_flag_64<false, true>*)libomp.so0.000.000.000.000.010.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000010000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANAOMP (%): 100.00NANANA0.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.002.083.825.316.456.856.916.490.002.083.825.316.456.856.916.490.000.260.290.370.330.350.350.320.000.260.290.370.330.350.350.320.000.190.350.470.580.610.600.590.000.190.350.470.580.610.600.590137153147630.000.001.191.381.071.291.091.520.000.000.040.050.040.050.040.06NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__pthread_mutex_locklibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAPthread (%): 100.00NANA0.000.000.000.000.000.000.000.00
__sched_yieldlibc.so.60.000.901.642.893.463.493.443.240.000.901.642.893.463.493.443.240.000.110.140.170.190.200.170.150.000.110.140.170.190.200.170.150.000.080.150.260.310.310.300.290.000.080.150.260.310.310.300.290137153147630.000.000.710.620.650.820.710.690.000.000.030.020.020.030.020.03NAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
unknown_functionlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.00
void __kmp_suspend_64<false, true>(int, kmp_flag_64<false, true>*)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAOMP (%): 100.00NANA0.000.000.000.000.000.000.000.00
__kmp_yieldlibomp.so0.000.000.070.060.090.130.110.120.000.000.070.060.090.130.110.120.000.000.010.010.020.020.010.020.000.000.010.010.020.020.010.020.000.000.010.010.010.010.010.010.000.000.010.010.010.010.010.01001371628300.000.000.000.080.150.150.090.130.000.000.000.000.010.010.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
@plt_start@libomp.so0.000.080.150.110.190.260.200.200.000.080.150.110.190.260.200.200.000.010.010.010.030.020.020.020.000.010.010.010.030.020.020.020.000.010.010.010.020.020.020.020.000.010.010.010.020.020.020.020124122639390.000.000.000.110.230.140.140.190.000.000.000.000.010.000.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__aarch64_ldadd4_acq_rellibomp.so0.000.040.000.000.000.000.000.000.000.040.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00010000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NAOMP (%): 100.00NANANANANANA0.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.0025.6849.0764.8974.4279.0580.0773.700.0025.6849.0764.8974.4279.0580.0773.700.003.283.403.413.403.373.403.120.003.283.403.413.403.373.403.120.002.324.445.796.686.996.996.660.002.324.445.796.686.996.996.660137153147630.000.001.212.191.401.932.338.330.000.000.060.120.090.170.240.59NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__kmp_now_nseclibomp.so0.000.000.020.010.060.060.100.090.000.000.020.010.060.060.100.090.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.01001171125240.000.000.000.000.050.060.070.100.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
×