options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_26_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_26_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_26_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_26_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_26_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_26_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_26_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_26_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_26_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_26_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_26_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_26_threads) Efficiency (run_26_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_26_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_26_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_26_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_26_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_26_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_26_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_26_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_26_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_26_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_26_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_26_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_26_threads) Efficiency(run_26_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-funroll-soa95.8295.2595.1694.3792.8591.000.000.000.000.000.000.00248.73114.8162.1831.1115.559.580.000.000.000.000.000.00248.73119.6569.4639.5724.4818.550.000.000.000.000.000.00124816260.005.736.938.499.399.330.000.020.020.010.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.003.026.2710.8018.9530.6340.42GNU C++14 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ...101.0400.99.980.7920.220.6433.890.5244.07
Loop 3 - main_soa.cpp:58-69 - kmeans-gcc-O3-funroll-soa+95.8295.2595.1694.3792.8591.005.044.695.024.904.834.82248.73115.0062.2931.3015.699.7613.085.763.341.750.870.60248.73119.6569.4639.5724.4818.5513.085.893.662.061.270.98124816260.000.410.410.520.520.590.000.160.050.070.040.052.544.448.8215.6425.4532.43101.1100.890.540.810.641.730.512.35
Loop 2 - main_soa.cpp:62-66 - kmeans-gcc-O3-funroll-soa90.7890.5690.1489.4788.0286.1890.7890.5690.1489.4788.0286.18235.65109.2458.9429.5514.819.16235.65109.2458.9429.5514.819.16235.65113.7665.8037.5123.2117.57235.65113.7665.8037.5123.2117.57124816260.005.316.528.028.928.940.000.140.060.080.040.053.046.3610.9119.1330.9240.87101.0400.99.430.7919.220.6332.160.5241.71
Loop 4 - main_soa.cpp:62-69 - kmeans-gcc-O3-funroll-soa [...]0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-gcc-O3-funroll-soa4.174.494.144.104.043.950.000.000.000.000.000.0010.8210.8210.8110.8210.8110.810.000.000.000.000.000.0010.825.643.021.721.060.810.000.000.000.000.000.001111110.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.460.891.662.914.706.21GNU C++14 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ...100.960.180.90.430.790.880.641.470.521.91
Loop 6 - main_soa.cpp:56-95 - kmeans-gcc-O3-funroll-soa [...]+4.174.494.144.104.043.950.000.000.000.000.000.0010.8210.8210.8110.8210.8110.810.000.000.000.000.000.0010.825.643.021.721.060.810.000.000.000.000.000.000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main_soa.cpp:86-93 - kmeans-gcc-O3-funroll-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main_soa.cpp:81-84 - kmeans-gcc-O3-funroll-soa4.174.494.144.104.043.954.174.494.144.104.043.9510.8210.8210.8110.8210.8110.8110.8210.8210.8110.8210.8110.8110.825.643.021.721.060.8110.825.643.021.721.060.811111110.000.000.000.000.000.000.000.000.000.000.000.000.460.891.662.914.706.21100.960.180.90.430.790.880.641.470.521.91
unknown_kernel_regionkernel0.010.010.010.010.010.000.000.000.000.000.000.000.030.010.020.010.010.000.000.000.000.000.000.000.030.010.010.000.000.000.000.000.000.000.000.001135220.000.000.010.010.020.030.000.000.010.000.000.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.000.330.000.000.000.0026.82
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.040.070.100.120.120.000.040.070.100.120.120.000.080.070.040.030.030.000.080.070.040.030.030.000.040.050.040.030.030.000.040.050.040.030.03013815260.000.000.010.040.030.060.000.000.010.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.00101010101010
gomp_barrier_wait_endlibgomp.so.1.0.00.000.220.611.412.994.930.000.220.611.412.994.930.000.520.540.550.540.560.000.520.540.550.540.560.000.270.450.590.791.000.000.270.450.590.791.00013715250.000.000.020.040.040.080.000.000.010.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.00101010101010
×