options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-vectorize-soa95.7495.7495.7495.7395.7395.6995.6395.570.000.000.000.000.000.000.000.00202.51101.3050.7125.4112.846.424.293.230.000.000.000.000.000.000.000.00202.51105.6057.1732.9520.9214.7712.7311.730.000.000.000.000.000.000.000.001248163248640.005.777.539.2510.2910.319.779.210.000.210.050.030.020.020.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.328.1615.3126.4841.7558.5068.1774.49GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v1+sm4+crc+aes+sha3 -g -O3 -std=c++14 -fno-omit-frame-pointer -fopenmp -ftree-vectorize 100.963.940.8910.960.7722.190.637.820.4354.690.3363.940.2769.78
Loop 6 - main_soa.cpp:58-69 - kmeans-gcc-O3-vectorize-soa+95.7495.7495.7495.7395.7395.6995.6395.570.000.000.000.000.000.000.000.00202.51101.3050.9825.7213.106.604.483.390.000.000.000.000.000.000.000.00202.51105.6057.1732.9520.9214.7712.7311.730.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main_soa.cpp:62-69 - kmeans-gcc-O3-vectorize-soa [...]0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 5 - main_soa.cpp:60-69 - kmeans-gcc-O3-vectorize-soa+95.7495.7495.7495.7395.7395.6995.6395.578.809.078.688.849.148.688.818.81202.51101.3050.9825.7213.106.604.483.3918.629.634.682.531.330.680.490.40202.51105.6057.1732.9520.9214.7712.7311.7318.6210.005.193.042.001.341.171.081248163248640.000.490.900.991.171.281.361.450.000.070.120.100.080.050.040.044.317.6114.4024.7138.1955.9165.6870.27100.930.630.90.890.772.070.583.810.434.910.335.90.276.44
Loop 4 - main_soa.cpp:62-66 - kmeans-gcc-O3-vectorize-soa86.9486.6787.0686.9086.5987.0286.8286.7686.9486.6787.0686.9086.5987.0286.8286.76183.8991.6746.2923.1911.775.923.992.98183.8991.6746.2923.1911.775.923.992.98183.8995.6051.9829.9118.9313.4311.5610.65183.8995.6051.9829.9118.9313.4311.5610.651248163248640.005.276.638.359.339.348.948.440.000.130.160.120.090.060.050.044.328.2115.4026.6642.1258.7668.4374.92100.963.310.8810.070.7720.120.6134.010.4349.780.3358.040.2763.34
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-gcc-O3-vectorize-soa4.264.264.254.254.214.224.224.210.000.000.000.000.000.000.000.009.028.999.009.009.009.019.029.050.000.000.000.000.000.000.000.009.024.702.541.460.920.650.560.520.000.000.000.000.000.000.000.00111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.551.061.973.425.447.688.919.68GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v1+sm4+crc+aes+sha3 -g -O3 -std=c++14 -fno-omit-frame-pointer -fopenmp -ftree-vectorize100.960.170.890.480.770.970.611.630.432.390.332.80.273.06
Loop 16 - main_soa.cpp:56-96 - kmeans-gcc-O3-vectorize-soa [...]+4.264.264.254.254.214.224.224.210.000.000.000.000.000.000.000.009.028.999.009.009.009.019.029.050.000.000.000.000.000.000.000.009.024.702.541.460.920.650.560.520.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 12 - main_soa.cpp:56-95 - kmeans-gcc-O3-vectorize-soa [...]+4.264.264.254.254.214.224.224.210.000.000.000.000.000.000.000.009.028.999.009.009.009.019.029.050.000.000.000.000.000.000.000.009.024.702.541.460.920.650.560.520.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 15 - main_soa.cpp:84-84 - kmeans-gcc-O3-vectorize-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 14 - main_soa.cpp:56-95 - kmeans-gcc-O3-vectorize-soa [...]+0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 13 - main_soa.cpp:86-93 - kmeans-gcc-O3-vectorize-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 11 - main_soa.cpp:81-84 - kmeans-gcc-O3-vectorize-soa4.264.264.254.254.214.224.224.214.264.264.254.254.214.224.224.219.028.999.009.009.009.019.029.059.028.999.009.009.009.019.029.059.024.702.541.460.920.650.560.529.024.702.541.460.920.650.560.52111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.551.061.973.425.447.688.919.68100.960.170.890.480.770.970.611.630.432.390.332.80.273.06
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.000.000.010.040.040.080.100.000.000.000.010.040.040.080.100.000.000.000.010.010.010.010.010.000.000.000.010.010.010.010.010.000.000.000.000.010.010.010.010.000.000.000.000.010.010.010.010003101324360.000.000.000.010.030.040.070.080.000.000.000.000.000.000.000.00NANANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.00
_dl_fatal_printfld-linux-aarch64.so.10.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANASystem (%): 100.00NA0.000.000.000.000.000.000.000.00
×