options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_10_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_10_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_10_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_10_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_10_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_10_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_10_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_10_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-funroll-soa95.4695.7195.4195.4595.280.000.000.000.000.00151.0682.5940.5822.7018.190.000.000.000.000.00151.0685.8445.4928.4524.120.000.000.000.000.001248100.005.607.408.338.770.000.110.070.050.05Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.968.7416.4926.3631.10GNU C++14 15.1.1 20250425 -march=cascadelake -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi -mn...100.8811.50.8316.20.6632.10.6335.6
Loop 3 - main_soa.cpp:58-69 - kmeans-gcc-O3-funroll-soa+95.4695.7195.4195.4595.285.004.835.015.155.12151.0682.6340.7822.8918.457.904.182.271.401.17151.0685.8445.4928.4524.127.904.332.391.541.291248100.000.240.240.320.660.000.030.120.090.093.327.5711.1420.8025.14100.910.420.830.860.641.840.611.99
Loop 2 - main_soa.cpp:62-66 - kmeans-gcc-O3-funroll-soa90.4690.8890.4090.3090.1790.4690.8890.4090.3090.17143.1578.4538.5221.4917.28143.1578.4538.5221.4917.28143.1581.5143.1026.9122.82143.1581.5143.1026.9122.821248100.005.367.228.138.340.000.140.170.120.115.068.8016.7826.6831.43100.8811.070.8315.340.6630.260.6333.61
Loop 4 - main_soa.cpp:62-69 - kmeans-gcc-O3-funroll-soa [...]0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-gcc-O3-funroll-soa4.524.144.203.773.760.000.000.000.000.007.157.137.147.157.160.000.000.000.000.007.153.712.001.120.950.000.000.000.000.00111110.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.701.352.504.455.25GNU C++14 15.1.1 20250425 -march=cascadelake -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi -mn...100.960.150.890.450.80.770.750.93
Loop 6 - main_soa.cpp:56-95 - kmeans-gcc-O3-funroll-soa [...]+4.524.144.203.773.760.000.000.000.000.007.157.137.147.157.160.000.000.000.000.007.153.712.001.120.950.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main_soa.cpp:86-93 - kmeans-gcc-O3-funroll-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main_soa.cpp:81-84 - kmeans-gcc-O3-funroll-soa4.524.144.203.773.764.524.144.203.773.767.157.137.147.157.167.157.137.147.157.167.153.712.001.120.957.153.712.001.120.95111110.000.000.000.000.000.000.000.000.000.000.701.352.504.455.25100.960.150.890.450.80.770.750.93
unknown_kernel_regionkernel0.020.020.000.020.010.000.000.000.000.000.030.020.000.010.010.000.000.000.000.000.030.020.000.010.000.000.000.000.000.00121440.000.010.000.020.020.000.010.000.000.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.000.000.640.001.820.00
gomp_barrier_wait_endlibgomp.so.1.0.00.000.070.210.440.570.000.070.210.440.570.000.120.130.130.130.000.120.130.130.130.000.060.100.130.140.000.060.100.130.14013790.000.000.010.030.020.000.000.000.010.00NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.060.170.310.370.000.060.170.310.370.000.110.120.110.120.000.110.120.110.120.000.060.080.090.090.000.060.080.090.090147100.000.000.110.060.140.000.000.050.020.03NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
×