options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]+kmeans-acfl-O3-funroll-soa95.9094.3091.2685.8676.6663.6054.0947.130.000.000.000.000.000.000.000.00216.34108.0554.2227.1913.646.924.613.480.000.000.000.000.000.000.000.00216.34110.6158.0031.2817.5510.307.516.010.000.000.000.000.000.000.000.001248163248640.003.164.064.644.673.883.172.580.000.180.070.030.020.020.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.148.1015.4628.6551.0587.03119.32149.21Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.982.080.936.150.8611.640.7717.610.6621.850.621.630.5620.6
Loop 7 - main_soa.cpp:58-69 - kmeans-acfl-O3-funroll-soa+95.9094.3091.2685.8676.6663.6054.0947.130.000.000.000.000.000.000.000.00216.34108.2854.9227.7214.127.164.833.620.000.000.000.000.000.000.000.00216.34110.6158.0031.2817.5510.307.516.010.000.000.000.000.000.000.000.00010000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main_soa.cpp:59-69 - kmeans-acfl-O3-funroll-soa+95.9094.2991.2685.8676.6663.6054.0947.1319.4018.7618.6817.3315.4312.9110.949.57216.34108.2854.9227.7214.127.164.833.6243.7621.7211.565.763.071.521.050.79216.34110.6058.0031.2817.5510.307.516.0143.7622.0111.876.313.532.091.521.221248163248640.000.360.461.161.320.881.020.810.000.340.370.180.120.070.060.044.007.9914.9227.9349.6684.31116.16144.69100.990.110.921.470.872.310.773.480.654.460.64.380.564.2
Loop 9 - main_soa.cpp:62-67 - kmeans-acfl-O3-funroll-soa76.5075.5372.5868.5461.2350.6943.1537.5676.5075.5372.5868.5461.2350.6943.1537.56172.5786.5643.3621.9511.055.633.782.82172.5786.5643.3621.9511.055.633.782.82172.5788.6046.1224.9714.028.215.994.79172.5788.6046.1224.9714.028.215.994.791248163248640.002.803.743.693.593.312.502.130.000.160.310.180.130.070.060.044.188.1315.5928.8351.4087.73120.13150.36100.971.970.944.690.869.330.7714.130.6617.380.617.260.5616.4
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-acfl-O3-funroll-soa4.104.043.933.663.262.672.281.980.000.000.000.000.000.000.000.009.259.249.339.249.259.269.269.250.000.000.000.000.000.000.000.009.254.742.501.330.750.430.320.250.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.541.062.003.756.7011.5615.8219.78Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.980.10.930.290.870.480.770.730.670.890.610.890.570.85
Loop 4 - main_soa.cpp:56-93 - kmeans-acfl-O3-funroll-soa [...]+4.104.043.933.663.262.672.281.980.000.000.000.000.000.000.000.009.259.249.339.249.259.269.269.250.000.000.000.000.000.000.000.009.254.742.501.330.750.430.320.250.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main_soa.cpp:81-84 - kmeans-acfl-O3-funroll-soa4.104.043.933.663.262.672.281.984.104.043.933.663.262.672.281.989.259.249.339.249.259.269.269.259.259.249.339.249.259.269.269.259.254.742.501.330.750.430.320.259.254.742.501.330.750.430.320.25111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.541.062.003.756.7011.5615.8219.78100.980.10.930.290.870.480.770.730.670.890.610.890.570.85
Loop 5 - main_soa.cpp:86-93 - kmeans-acfl-O3-funroll-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.120.330.781.512.713.253.800.000.120.330.781.512.713.253.800.000.270.310.330.330.380.390.400.000.270.310.330.330.380.390.400.000.140.210.290.350.440.450.480.000.140.210.290.350.440.450.480137163147630.000.000.090.100.460.390.630.560.000.000.050.030.080.040.050.04NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__kmp_yieldlibomp.so0.000.000.010.010.020.050.070.050.000.000.010.010.020.050.070.050.000.000.010.000.010.010.020.020.000.000.010.000.010.010.020.020.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.010014102431300.000.000.000.000.010.030.050.070.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
__kmp_fork_calllibomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
parse_printf_formatlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000010000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANAIO (%): 100.00NANANA0.000.000.000.000.000.000.000.00
__sched_yieldlibc.so.60.000.070.190.410.741.331.671.920.000.070.190.410.741.331.671.920.000.170.180.200.190.200.200.200.000.170.180.200.190.200.200.200.000.090.120.150.170.210.230.240.000.090.120.150.170.210.230.240137153148630.000.000.060.120.160.270.400.380.000.000.040.040.030.030.030.03NAPthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 99.93
System (%): 0.00
OMP (%): 0.07
Pthread (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
__kmpc_for_static_init_4libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
unknown_function[vdso]0.000.010.030.080.140.220.300.330.000.000.000.000.000.000.000.000.000.020.020.050.050.050.050.050.000.000.000.000.000.000.000.000.000.010.020.030.030.040.040.040.000.000.000.000.000.000.000.000137153146630.000.000.010.040.060.100.140.150.000.000.000.010.010.010.010.01NAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.000.000.000.000.000.000.000.000.001010101010101010
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.001.464.239.1717.6029.3038.2044.630.001.464.239.1717.6029.3038.2044.630.003.333.403.393.463.423.493.450.003.333.403.393.463.423.493.450.001.712.693.344.034.745.305.690.001.712.693.344.034.745.305.690137163148640.000.000.160.324.790.545.755.780.000.000.100.100.830.070.480.42NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
pthread_cond_waitlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAPthread (%): 100.000.000.000.000.000.000.000.000.00
__kmp_now_nseclibomp.so0.000.000.000.000.010.030.030.040.000.000.000.000.010.030.030.040.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000241520270.000.000.000.000.020.030.030.040.000.000.000.000.000.000.000.00NANANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
std::__use_cache<std::__numpunct_cache<char> >::operator()(std::locale const&) const [clone .isra.0]libstdc++.so.6.0.330.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOthers (%): 100.000.000.000.000.000.000.000.000.00
unknown_functionlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAPthread (%): 100.000.000.000.000.000.000.000.000.00
×