options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-acfl-O3-all99.5670.9244.3525.3713.896.984.874.610.000.000.000.000.000.000.000.008.999.028.998.928.948.398.418.770.000.000.000.000.000.000.000.008.996.414.002.281.250.600.420.410.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.560.781.242.173.957.7811.1211.74Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.721.170.5619.420.4912.840.457.630.473.690.452.680.343.03
Loop 4 - main.cpp:21-100 - kmeans-acfl-O3-all [...]+99.5670.9244.3525.3713.896.984.874.610.000.000.000.000.000.000.000.008.999.028.998.928.948.398.418.770.000.000.000.000.000.000.000.008.996.414.002.281.250.600.420.410.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main.cpp:89-92 - kmeans-acfl-O3-all99.5670.9244.3525.3713.896.984.874.6199.5670.9244.3525.3713.896.984.874.618.999.028.998.928.948.398.418.778.999.028.998.928.948.398.418.778.996.414.002.281.250.600.420.418.996.414.002.281.250.600.420.41111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.560.781.242.173.957.7811.1211.74100.721.170.5619.420.4912.840.457.630.473.690.452.680.343.03
Loop 5 - main.cpp:94-100 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined]+kmeans-acfl-O3-all0.330.510.371.051.122.453.8411.200.000.000.000.000.000.000.000.000.030.050.040.100.090.170.240.680.000.000.000.000.000.000.000.000.030.050.030.090.100.210.331.000.000.000.000.000.000.000.000.001248163148640.000.840.370.850.791.422.199.200.000.020.020.030.030.050.070.19Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.464.9014.2020.4535.1350.9460.1137.11Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.320.340.220.290.041.010.021.102.4403.84011.19
Loop 7 - main.cpp:66-74 - kmeans-acfl-O3-all+0.330.510.371.051.122.453.8411.200.000.000.000.010.030.030.030.080.030.050.040.100.090.180.250.690.000.000.000.000.000.010.000.010.030.050.030.090.100.210.331.000.000.000.000.000.000.000.000.010128132645620.000.000.000.050.060.080.060.170.000.000.000.000.000.000.000.000.000.000.0014.6912.5342.63104.5952.38
Loop 8 - main.cpp:68-74 - kmeans-acfl-O3-all0.330.510.371.041.092.423.8211.120.330.510.371.041.092.423.8211.120.030.050.040.100.080.170.240.680.030.050.040.100.080.170.240.680.030.050.030.090.100.210.330.990.030.050.030.090.100.210.330.991248163148640.000.840.370.850.761.422.179.140.000.020.020.030.030.050.070.180.464.7913.9720.5335.7851.0459.8137.00100.320.340.220.290.0410.021.0702.4103.81011.11
__kmp_get_global_thread_id_reglibomp.so0.060.000.000.000.000.000.000.000.060.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00100000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00OMP (%): 100.00NANANANANANANA0.000.000.000.000.000.000.000.00
std::ostreambuf_iterator<char, std::char_traits<char> > std::num_put<char, std::ostreambuf_iterator<char, std::char_traits<char> > >::_M_insert_float<double>(std::ostreambuf_iterator<char, std::char_traits<char>...libstdc++.so.6.0.330.060.000.000.000.000.000.000.000.060.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00100000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Others (%): 100.00NANANANANANANA0.000.000.000.000.000.000.000.00
@plt_start@libomp.so0.000.040.220.330.230.200.210.240.000.040.220.330.230.200.210.240.000.000.030.020.020.020.030.030.000.000.030.020.020.020.030.030.000.000.020.030.020.020.020.020.000.000.020.030.020.020.020.020137132636470.000.000.270.140.190.140.150.170.000.000.010.010.010.000.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__sched_yieldlibc.so.60.001.181.852.663.163.313.583.350.001.181.852.663.163.313.583.350.000.150.140.160.180.180.210.210.000.150.140.160.180.180.210.210.000.110.170.240.280.280.310.300.000.110.170.240.280.280.310.300137153147630.000.000.370.650.580.760.811.040.000.000.010.020.020.030.030.04NAPthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
unknown_function[vdso]0.000.240.370.570.510.680.640.530.000.000.000.000.000.000.000.000.000.030.030.050.050.060.040.040.000.000.000.000.000.000.000.000.000.020.030.050.050.060.060.050.000.000.000.000.000.000.000.000137153147610.000.000.230.290.330.380.280.290.000.000.010.010.010.010.010.01NAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.000.000.000.000.000.000.000.000.001010101010101010
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.002.243.776.336.517.116.916.520.002.243.776.336.517.116.916.520.000.280.280.430.340.350.340.330.000.280.280.430.340.350.340.330.000.200.340.570.580.610.590.580.000.200.340.570.580.610.590.580137153147630.000.000.381.570.741.081.091.530.000.000.020.060.030.040.040.07NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
_dl_allocate_tls_initld-linux-aarch64.so.10.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAPthread (%): 100.00NANA0.000.000.000.000.000.000.000.00
__kmp_yieldlibomp.so0.000.000.050.100.080.110.080.090.000.000.050.100.080.110.080.090.000.000.000.010.010.020.010.020.000.000.000.010.010.020.010.020.000.000.000.010.010.010.010.010.000.000.000.010.010.010.010.01002481822280.000.000.000.130.060.120.080.120.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.0024.8348.9663.5874.3879.0679.7973.420.0024.8348.9663.5874.3879.0679.7973.420.003.163.353.413.353.433.413.220.003.163.353.413.353.433.413.220.002.244.425.716.686.766.856.540.002.244.425.716.686.766.856.540137153147630.000.000.591.881.351.942.177.940.000.000.030.110.100.160.230.59NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
std::ostream& std::ostream::_M_insert<double>(double)libstdc++.so.6.0.330.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAOthers (%): 100.00NANA0.000.000.000.000.000.000.000.00
__kmp_now_nseclibomp.so0.000.040.050.010.120.090.070.050.000.040.050.010.120.090.070.050.000.000.000.010.010.020.010.010.000.000.000.010.010.020.010.010.000.000.000.000.010.010.010.000.000.000.000.000.010.010.010.000121101620160.000.000.000.000.100.090.070.070.000.000.000.000.000.000.000.00NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
×