options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]+kmeans-acfl-O3-all-soa94.3092.1788.1181.0470.0355.0845.3938.540.000.000.000.000.000.000.000.00152.8676.5738.6319.279.694.883.262.450.000.000.000.000.000.000.000.00152.8679.1142.2023.1213.327.805.714.530.000.000.000.000.000.000.000.001248163248640.004.315.195.615.243.852.952.330.000.030.070.020.020.010.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.005.7211.0620.7337.8565.67112.24153.35193.19Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.973.110.918.320.8314.050.7219.810.6121.330.5620.060.5318.21
Loop 7 - main_soa.cpp:58-69 - kmeans-acfl-O3-all-soa+94.3092.1788.1181.0470.0355.0845.3938.540.000.000.000.000.000.000.000.00152.8676.8339.1319.7710.085.053.452.660.000.000.000.000.000.000.000.00152.8679.1142.2023.1213.327.805.714.530.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main_soa.cpp:59-69 - kmeans-acfl-O3-all-soa+94.3092.1788.1181.0470.0355.0845.3938.5418.9917.9517.4316.4613.9911.129.237.76152.8676.8339.1319.7710.085.053.452.6630.7815.067.834.262.171.070.760.63152.8679.1142.2023.1213.327.805.714.5330.7815.418.354.702.661.571.160.911248163248640.000.591.530.721.500.940.930.830.000.210.240.170.110.050.050.046.4913.0423.6743.8174.71126.90173.99219.691010.020.921.370.822.970.723.880.614.330.554.130.533.67
Loop 9 - main_soa.cpp:62-67 - kmeans-acfl-O3-all-soa75.3174.2270.6864.5856.0443.9536.1630.7875.3174.2270.6864.5856.0443.9536.1630.78122.0861.7731.3015.517.913.982.692.04122.0861.7731.3015.517.913.982.692.04122.0863.7033.8518.4210.666.224.553.62122.0863.7033.8518.4210.666.224.553.621248163248640.003.723.675.004.033.152.371.960.000.180.250.150.120.050.050.045.5310.5820.0136.3463.42108.53148.08186.51100.963.090.96.950.8311.080.7215.940.61170.5615.930.5314.55
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-acfl-O3-all-soa5.705.575.284.874.223.282.712.300.000.000.000.000.000.000.000.009.239.259.259.259.339.269.299.300.000.000.000.000.000.000.000.009.234.782.531.390.800.460.340.270.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.541.051.983.606.2210.7814.7018.50Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.970.190.910.460.830.820.721.190.621.240.571.170.531.07
Loop 4 - main_soa.cpp:56-93 - kmeans-acfl-O3-all-soa [...]+5.705.575.284.874.223.282.712.300.000.000.000.000.000.000.000.009.239.259.259.259.339.269.299.300.000.000.000.000.000.000.000.009.234.782.531.390.800.460.340.270.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main_soa.cpp:81-84 - kmeans-acfl-O3-all-soa5.705.575.284.874.223.282.712.305.705.575.284.874.223.282.712.309.239.259.259.259.339.269.299.309.239.259.259.259.339.269.299.309.234.782.531.390.800.460.340.279.234.782.531.390.800.460.340.27111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.541.051.983.606.2210.7814.7018.50100.970.190.910.460.830.820.721.190.621.240.571.170.531.07
Loop 5 - main_soa.cpp:86-93 - kmeans-acfl-O3-all-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
__pthread_mutex_locklibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAPthread (%): 100.00NA0.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.140.511.052.043.303.784.260.000.140.511.052.043.303.784.260.000.230.330.320.510.690.370.350.000.230.330.320.510.690.370.350.000.120.240.300.390.470.480.500.000.120.240.300.390.470.480.500147153148640.000.000.340.130.500.960.800.850.000.000.140.030.070.080.060.05NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__sched_yieldlibc.so.60.000.090.270.490.991.511.922.210.000.090.270.490.991.511.922.210.000.150.190.160.170.200.200.210.000.150.190.160.170.200.200.210.000.080.130.140.190.210.240.260.000.080.130.140.190.210.240.260137153147630.000.000.110.090.150.370.340.430.000.000.050.020.020.030.020.03NAPthread (%): 100.00Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
unknown_function[vdso]0.000.010.040.070.190.280.380.400.000.000.000.000.000.000.000.000.000.010.040.030.050.060.060.060.000.000.000.000.000.000.000.000.000.010.020.020.040.040.050.050.000.000.000.000.000.000.000.000137153147620.000.000.040.030.090.140.140.190.000.000.020.010.010.010.010.01NAPthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check()libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.00
__memchrlibastring.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAString (%): 100.00NA0.000.000.000.000.000.000.000.00
__aarch64_ldset8_acq_rellibomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
@plt_start@libomp.so0.000.010.010.030.070.120.130.180.000.010.010.030.070.120.130.180.000.010.010.010.020.030.030.040.000.010.010.010.020.030.030.040.000.010.010.010.010.020.020.020.000.010.010.010.010.020.020.020126142738560.000.000.010.020.050.080.100.110.000.000.000.000.010.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
void __kmp_suspend_64<false, true>(int, kmp_flag_64<false, true>*)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.00
__kmp_join_barrier(int)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.002.015.7712.4122.3936.3645.5852.010.002.015.7712.4122.3936.3645.5852.010.003.333.413.483.503.483.493.580.003.333.413.483.503.483.493.580.001.722.763.544.265.155.736.110.001.722.763.544.265.155.736.110148163248640.000.003.915.176.156.836.876.750.000.001.661.190.830.590.490.42NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__kmp_invoke_task_funclibomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
__kmp_now_nseclibomp.so0.000.000.010.010.030.020.030.030.000.000.010.010.030.020.030.030.000.000.010.010.010.010.010.030.000.000.010.010.010.010.010.030.000.000.000.000.010.000.000.000.000.000.000.000.010.000.000.00002281115180.000.000.010.020.020.040.040.080.000.000.000.000.000.000.000.01NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
unknown_functionlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAPthread (%): 100.00NA0.000.000.000.000.000.000.000.00
×