options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_10_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_10_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_10_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_10_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_10_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_10_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
9kmeans-clang-O3-all-soa - main_soa.cpp:62-67k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Innermost88.4485.6581.2173.2970.0988.4485.6581.2173.2970.09133.2467.2335.8118.5714.84133.2467.2335.8118.5714.84133.2468.2637.5220.4016.73133.2468.2637.5220.4016.731248105.2610.2618.6634.3241.86011.6112.319.381111.011.0102000100.00100.982.050.899.110.8213.450.814.25
7kmeans-clang-O3-all-soa - main_soa.cpp:81-84k_means(int, point_t&, point_t&, int*, point_t&, int, int)Innermost4.904.764.413.943.744.904.764.413.943.747.387.457.757.917.897.387.457.757.917.897.383.792.041.100.897.383.792.041.100.89111110.681.322.464.565.60011.611.31.089.81111110300350.00100.970.130.910.420.840.630.830.65
10kmeans-clang-O3-all-soa - main_soa.cpp:62-67k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Innermost4.404.294.083.873.594.404.294.083.873.596.633.371.851.070.816.633.371.851.070.816.633.421.891.080.866.633.421.891.080.861248104.017.9914.3026.0131.24011.6112.5810111.031.111.0802000100.00100.970.130.880.50.770.890.770.81
8kmeans-clang-O3-all-soa - main_soa.cpp:58-69k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Outermost2.242.162.011.931.8295.0892.0987.3079.0975.493.381.720.960.540.43143.2572.2538.4719.9215.953.381.720.930.540.43143.2573.3940.3422.0118.021248106.8213.0824.4840.7653.0813.3310.422.5511411.021.081.111.13NANANANANA0.00100.980.040.910.180.790.410.780.4
×