options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage run_80_threads (%) Coverage run_96_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Coverage Excluding Loops run_80_threads (%) Coverage Excluding Loops run_96_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_80_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_80_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_80_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_80_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Nb Threads run_80_threads Nb Threads run_96_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (coverage) run_80_threads Deviation (coverage) run_96_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Deviation (walltime) run_80_threads Deviation (walltime) run_96_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads Categories run_80_threads Categories run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads GFLOPS run_80_threads GFLOPS run_96_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_80_threads) Efficiency (run_80_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage run_80_threads (%)Coverage run_96_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Coverage Excluding Loops run_80_threads (%)Coverage Excluding Loops run_96_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_80_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_80_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_80_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_80_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsNb Threads run_80_threadsNb Threads run_96_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (coverage) run_80_threadsDeviation (coverage) run_96_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsDeviation (walltime) run_80_threadsDeviation (walltime) run_96_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsCategories run_80_threadsCategories run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsGFLOPS run_80_threadsGFLOPS run_96_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_80_threads) Efficiency(run_80_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-acfl-Ofast100.0096.5355.7823.9612.045.723.712.682.131.820.000.000.000.000.000.000.000.000.000.007.647.647.657.647.657.677.607.457.537.300.000.000.000.000.000.000.000.000.000.007.647.384.281.850.930.440.280.210.160.150.000.000.000.000.000.000.000.000.000.0011111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.650.681.162.695.2911.1517.4323.5730.4131.39Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -g...100.5246.560.4530.890.5211.580.515.880.542.650.561.630.581.130.590.880.530.85
Loop 4 - main.cpp:21-101 - kmeans-acfl-Ofast [...]+100.0096.5355.7823.9612.045.723.712.682.131.820.000.000.000.000.000.000.000.000.000.007.647.647.657.647.657.677.607.457.537.300.000.000.000.000.000.000.000.000.000.007.647.384.281.850.930.440.280.210.160.150.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main.cpp:94-101 - kmeans-acfl-Ofast0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 5 - main.cpp:94-100 - kmeans-acfl-Ofast0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main.cpp:89-92 - kmeans-acfl-Ofast100.0096.5355.7823.9612.045.723.712.682.131.82100.0096.5355.7823.9612.045.723.712.682.131.827.647.647.657.647.657.677.607.457.537.307.647.647.657.647.657.677.607.457.537.307.647.384.281.850.930.440.280.210.160.157.647.384.281.850.930.440.280.210.160.1511111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.650.681.162.695.2911.1517.4323.5730.4131.39100.5246.560.4530.890.5211.580.515.880.542.650.561.630.581.130.590.880.530.85
__kmp_now_nseclibomp.so0.000.000.000.090.100.090.090.080.060.080.000.000.000.090.100.090.090.080.060.080.000.000.000.010.010.010.010.010.010.010.000.000.000.010.010.010.010.010.010.010.000.000.000.010.010.010.010.010.000.010.000.000.000.010.010.010.010.010.000.0100051014253431490.000.000.000.040.060.110.070.070.050.070.000.000.000.000.000.000.000.000.000.00NANANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.00
__GI___pthread_cond_waitlibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.000.000.00
__kmp_fork_calllibomp.so0.000.000.040.000.000.000.000.000.000.000.000.000.040.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000100000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANAOMP (%): 100.00NANANANANANANA0.000.000.000.000.000.000.000.000.000.00
std::ostream& std::ostream::_M_insert<long>(long)libstdc++.so.6.0.330.000.000.040.000.000.000.000.000.000.000.000.000.040.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000100000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANAOthers (%): 100.00NANANANANANANA0.000.000.000.000.000.000.000.000.000.00
unknown_function[vdso]0.000.130.580.860.640.680.680.640.630.620.000.000.000.000.000.000.000.000.000.000.000.010.030.070.040.050.050.060.060.050.000.000.000.000.000.000.000.000.000.000.000.010.040.070.050.050.050.050.050.050.000.000.000.000.000.000.000.000.000.0001371531476379950.000.000.400.440.230.270.250.260.260.290.000.000.010.020.010.010.010.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00
Others (%): 0.00
OMP (%): 100.00OMP (%): 100.00
Others (%): 0.00
OMP (%): 100.00
Others (%): 0.00
OMP (%): 100.00
Others (%): 0.00
OMP (%): 100.00
Others (%): 0.00
OMP (%): 100.00
Others (%): 0.00
0.000.000.000.000.000.000.000.000.000.0010101010101010101010
__kmp_yieldlibomp.so0.000.000.070.280.130.160.110.160.120.130.000.000.070.280.130.160.110.160.120.130.000.000.000.020.020.020.020.020.020.020.000.000.000.020.020.020.020.020.020.020.000.000.010.020.010.010.010.010.010.010.000.000.010.020.010.010.010.010.010.0100261024254651600.000.000.000.160.140.110.120.120.100.120.000.000.000.010.000.000.000.010.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.0010101010101010101010
__kmp_initialize_info(kmp_info*, kmp_team*, int, int)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NANANA0.000.000.000.000.000.000.000.000.000.00
__vsnprintf_internallibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAString (%): 100.00NANA0.000.000.000.000.000.000.000.000.000.00
@plt_start@libomp.so0.000.000.150.200.200.270.210.220.210.190.000.000.150.200.200.270.210.220.210.190.000.000.010.020.030.020.020.030.030.020.000.000.010.020.030.020.020.030.030.020.000.000.010.020.020.020.020.020.020.020.000.000.010.020.020.020.020.020.020.0200261328395167750.000.000.280.190.170.140.150.140.140.140.000.000.010.010.010.010.010.010.010.01NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.0010101010101010101010
__GI___pthread_mutex_locklibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000001000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANAOMP (%): 100.00NANA0.000.000.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.061.492.413.173.633.954.274.114.050.000.061.492.413.173.633.954.274.114.050.000.000.090.190.190.210.230.280.250.280.000.000.090.190.190.210.230.280.250.280.000.000.110.190.250.280.300.330.310.330.000.000.110.190.250.280.300.330.310.3301371531476379950.000.001.120.950.850.740.700.760.660.890.000.000.040.040.030.030.030.030.030.03NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.0010101010101010101010
__sched_yieldlibc.so.60.000.131.492.382.552.542.502.312.292.150.000.131.492.382.552.542.502.312.292.150.000.010.100.140.140.190.150.140.160.130.000.010.100.140.140.190.150.140.160.130.000.010.110.180.200.200.190.180.170.180.000.010.110.180.200.200.190.180.170.1801371531476379950.000.000.880.470.410.550.490.500.540.560.000.000.040.020.020.020.020.020.020.02NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00
System (%): 0.00
OMP (%): 100.00OMP (%): 100.00
System (%): 0.00
OMP (%): 100.00
System (%): 0.00
OMP (%): 100.00
System (%): 0.00
OMP (%): 100.00
System (%): 0.00
OMP (%): 100.00
System (%): 0.00
0.000.000.000.000.000.000.000.000.000.0010101010101010101010
void __kmp_suspend_64<false, true>(int, kmp_flag_64<false, true>*)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.002.9739.8568.8279.3083.9884.9984.7083.4477.650.002.9739.8568.8279.3083.9884.9984.7083.4477.650.000.232.253.683.833.963.923.994.023.580.000.232.253.683.833.963.923.994.023.580.000.233.065.316.166.526.526.526.386.410.000.233.065.316.166.526.526.526.386.4101371531476379950.000.001.611.301.421.791.952.613.207.080.000.000.690.370.340.160.080.090.120.13NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.0010101010101010101010
__condvar_dec_grefslibc.so.60.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANANANAOMP (%): 100.000.000.000.000.000.000.000.000.000.000.00
×