options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_10_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_10_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_10_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_10_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_10_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_10_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
9kmeans-clang-O3-soa - main_soa.cpp:62-67k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Innermost88.1685.7281.2274.1870.9888.1685.7281.2274.1870.98141.1471.2937.8919.7115.74141.1471.2937.8919.7115.74141.1472.3039.4721.4817.53141.1472.3039.4721.4817.531248104.939.6217.6332.3939.68011.6112.319.381111.011.0102000100.00100.982.050.898.620.8213.240.813.85
7kmeans-clang-O3-soa - main_soa.cpp:81-84k_means(int, point_t&, point_t&, int*, point_t&, int, int)Innermost4.544.384.073.663.504.544.384.073.663.507.277.287.587.737.717.277.287.587.737.717.273.701.981.060.867.273.701.981.060.86111110.691.352.524.725.79011.611.31.089.81111110300350.00100.980.070.920.330.860.520.840.55
10kmeans-clang-O3-soa - main_soa.cpp:62-67k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Innermost4.334.034.113.603.444.334.034.113.603.446.933.381.971.030.856.933.381.971.030.856.933.402.001.040.856.933.402.001.040.851248104.188.3814.2127.4933.95011.6112.581011.011.031.091.1202000100.00101.0200.870.550.830.610.820.63
8kmeans-clang-O3-soa - main_soa.cpp:58-69k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]Outermost2.952.852.722.442.2995.4492.6188.0680.2376.704.732.381.360.760.57152.8077.0641.0221.2316.994.732.411.320.710.57152.8078.1042.8023.2318.951248105.2410.7319.3236.1244.8513.3310.422.5511411.011.071.181.14NANANANANA0.00100.980.050.890.290.840.40.840.38
×