options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]+kmeans-acfl-Ofast-soa94.2992.1688.1080.9670.0355.1345.5638.540.000.000.000.000.000.000.000.00152.9476.6038.3819.279.694.893.262.460.000.000.000.000.000.000.000.00152.9479.1141.9723.1013.277.805.724.540.000.000.000.000.000.000.000.001248163248640.004.275.195.575.183.862.972.320.000.060.040.030.010.010.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.005.7211.0620.8537.8865.94112.17153.07192.75Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -greco...100.973.070.917.850.8313.950.7219.590.6121.360.5620.170.5318.25
Loop 7 - main_soa.cpp:58-69 - kmeans-acfl-Ofast-soa+94.2992.1688.1080.9670.0355.1345.5638.540.000.000.000.000.000.000.000.00152.9476.6038.6819.819.945.123.502.620.000.000.000.000.000.000.000.00152.9479.1141.9723.1013.277.805.724.540.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main_soa.cpp:59-69 - kmeans-acfl-Ofast-soa+94.2992.1688.1080.9670.0355.1345.5638.5418.8619.1717.4616.1513.9011.009.207.68152.9476.6038.6819.819.945.123.502.6230.5915.957.824.142.021.130.780.59152.9479.1141.9723.1013.277.805.724.5430.5916.468.324.612.631.561.150.901248163248640.000.851.051.361.000.980.990.700.000.040.180.170.060.060.050.046.5612.0224.0443.7075.87128.73175.61221.09100.931.350.921.40.832.750.733.810.614.250.554.120.533.62
Loop 9 - main_soa.cpp:62-67 - kmeans-acfl-Ofast-soa75.4372.9970.6564.8056.1344.1336.3630.8775.4372.9970.6564.8056.1344.1336.3630.87122.3460.6430.8615.677.923.992.722.03122.3460.6430.8615.677.923.992.722.03122.3462.6533.6618.4910.646.244.563.63122.3462.6533.6618.4910.646.244.563.631248163248640.003.424.244.484.313.222.422.070.000.020.180.170.070.060.050.045.5110.8120.0636.4363.48108.04147.37185.70100.981.720.916.440.8311.20.7215.780.6117.110.5616.050.5314.63
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-acfl-Ofast-soa5.705.565.324.874.193.282.712.300.000.000.000.000.000.000.000.009.259.239.259.259.269.279.269.290.000.000.000.000.000.000.000.009.254.772.531.390.790.460.340.270.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.541.051.983.606.3010.7614.7318.47Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -greco...100.970.170.910.460.830.810.731.140.621.240.571.170.531.07
Loop 4 - main_soa.cpp:56-93 - kmeans-acfl-Ofast-soa [...]+5.705.565.324.874.193.282.712.300.000.000.000.000.000.000.000.009.259.239.259.259.269.279.269.290.000.000.000.000.000.000.000.009.254.772.531.390.790.460.340.270.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main_soa.cpp:81-84 - kmeans-acfl-Ofast-soa5.705.565.324.874.193.282.712.305.705.565.324.874.193.282.712.309.259.239.259.259.269.279.269.299.259.239.259.259.269.279.269.299.254.772.531.390.790.460.340.279.254.772.531.390.790.460.340.27111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.541.051.983.606.3010.7614.7318.47100.970.170.910.460.830.810.731.140.621.240.571.170.531.07
Loop 5 - main_soa.cpp:86-93 - kmeans-acfl-Ofast-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.150.491.022.023.163.844.480.000.150.491.022.023.163.844.480.000.250.320.330.500.390.380.380.000.250.320.330.500.390.380.380.000.130.240.290.380.450.480.530.000.130.240.290.380.450.480.530137153147630.000.000.080.170.470.470.580.700.000.000.040.040.060.040.040.04NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__sched_yieldlibc.so.60.000.080.240.520.961.592.062.280.000.080.240.520.961.592.062.280.000.140.150.200.190.180.230.220.000.140.150.200.190.180.230.220.000.070.110.150.180.230.260.270.000.070.110.150.180.230.260.270137153147630.000.000.020.170.220.270.420.480.000.000.010.040.030.020.030.03NAPthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
Pthread (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.001010101010101010
pthread_createlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.00
unknown_function[vdso]0.000.010.040.120.190.220.370.370.000.000.000.000.000.000.000.000.000.020.040.060.060.040.060.050.000.000.000.000.000.000.000.000.000.010.020.030.040.030.050.040.000.000.000.000.000.000.000.000127153147630.000.000.030.070.100.090.150.170.000.000.010.010.010.010.010.01NAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.00
Others (%): 0.00
Pthread (%): 100.000.000.000.000.000.000.000.000.001010101010101010
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check()libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.00
__kmp_join_barrier(int)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAOMP (%): 100.00NANA0.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.002.025.7712.4322.4636.4145.2151.740.002.025.7712.4322.4636.4145.2151.740.003.363.413.503.493.503.513.500.003.363.413.503.493.503.513.500.001.742.753.554.265.155.676.090.001.742.753.554.265.155.676.090138153148640.000.000.145.170.610.786.806.710.000.000.061.190.090.080.480.42NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
pthread_cond_waitlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAPthread (%): 100.000.000.000.000.000.000.000.000.00
@plt_start@libomp.so0.000.010.010.050.100.120.140.160.000.010.010.050.100.120.140.160.000.010.020.030.030.030.030.020.000.010.020.030.030.030.030.020.000.010.010.010.020.020.020.020.000.010.010.010.020.020.020.020127152741530.000.000.030.040.060.070.090.100.000.000.010.010.010.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__kmp_now_nseclibomp.so0.000.000.000.020.020.030.030.050.000.000.000.020.020.030.030.050.000.000.000.010.000.010.010.010.000.000.000.010.000.010.010.010.000.000.000.010.000.000.000.010.000.000.000.010.000.000.000.01001571219300.000.000.000.010.000.040.030.040.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
__xpg_basenamelibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00001000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANASystem (%): 100.00NANANANANA0.000.000.000.000.000.000.000.00
unknown_functionlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000010010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANAOMP (%): 100.00NANAPthread (%): 100.000.000.000.000.000.000.000.000.00
×