options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_16_threads (%) Exclusive Coverage run_32_threads (%) Exclusive Coverage run_48_threads (%) Exclusive Coverage run_64_threads (%) Exclusive Coverage run_80_threads (%) Exclusive Coverage run_96_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_16_threads (%) Inclusive Coverage run_32_threads (%) Inclusive Coverage run_48_threads (%) Inclusive Coverage run_64_threads (%) Inclusive Coverage run_80_threads (%) Inclusive Coverage run_96_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_80_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_80_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_80_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_80_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Nb Threads run_80_threads Nb Threads run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads GFLOPS run_80_threads GFLOPS run_96_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_16_threads Speedup If Perfect Load Balancing run_32_threads Speedup If Perfect Load Balancing run_48_threads Speedup If Perfect Load Balancing run_64_threads Speedup If Perfect Load Balancing run_80_threads Speedup If Perfect Load Balancing run_96_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_80_threads) Efficiency (run_80_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_16_threads (%)Exclusive Coverage run_32_threads (%)Exclusive Coverage run_48_threads (%)Exclusive Coverage run_64_threads (%)Exclusive Coverage run_80_threads (%)Exclusive Coverage run_96_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_16_threads (%)Inclusive Coverage run_32_threads (%)Inclusive Coverage run_48_threads (%)Inclusive Coverage run_64_threads (%)Inclusive Coverage run_80_threads (%)Inclusive Coverage run_96_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_80_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_80_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_80_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_80_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsNb Threads run_80_threadsNb Threads run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsGFLOPS run_80_threadsGFLOPS run_96_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_16_threadsSpeedup If Perfect Load Balancing run_32_threadsSpeedup If Perfect Load Balancing run_48_threadsSpeedup If Perfect Load Balancing run_64_threadsSpeedup If Perfect Load Balancing run_80_threadsSpeedup If Perfect Load Balancing run_96_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_80_threads) Efficiency(run_80_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
4kmeans-gcc-O3-funroll-soa - main_soa.cpp:62-67k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]Innermost73.0973.2373.1873.4973.5573.2973.1272.7873.1273.5073.0973.2373.1873.4973.5573.2973.1272.7873.1273.50122.0661.2931.0115.597.863.982.712.021.681.52122.0661.2931.0115.597.863.982.712.021.681.52122.0663.9134.9120.2112.909.177.947.306.956.82122.0663.9134.9120.2112.909.177.947.306.956.8212481632486480965.3910.3518.8932.6351.0871.8283.0190.2694.6292.82047.2211.352.27111.011.011.021.021.051.041.081.09000000.00100.953.30.879.220.7518.020.5930.040.4242.810.3249.690.2653.750.2257.080.1959.81
5kmeans-gcc-O3-funroll-soa - main_soa.cpp:58-69k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]InBetween22.4622.3322.3822.0521.9922.2222.3722.6322.2722.0495.5595.5695.5695.5495.5395.5195.4995.4095.3995.5437.5118.759.594.802.441.280.940.700.600.54159.5879.8540.2620.1010.105.093.402.572.051.8437.5119.4810.686.073.862.782.432.272.122.05159.5883.3945.5926.2816.7511.9510.379.569.078.8712481632486480965.7810.9720.1635.4355.8477.3788.7695.26101.37100.08047.222.251.392.4611.011.021.041.061.091.181.161.261.31NANANANANA0.00100.960.830.882.730.7750.618.620.4212.860.3215.170.2616.780.2217.340.1917.83
16kmeans-gcc-O3-funroll-soa - main_soa.cpp:81-84k_means(int, point_t&, point_t&, int*, point_t&, int, int)Innermost4.454.444.424.424.414.384.374.354.394.114.454.444.424.424.414.384.374.354.394.117.437.417.457.407.437.437.447.447.497.477.437.417.457.407.437.437.447.447.497.477.433.872.111.210.770.550.470.440.420.387.433.872.111.210.770.550.470.440.420.3811111111110.671.292.374.126.469.1310.5411.4611.9813.09044.511.51.072.411111111110100325.00100.960.180.880.530.761.040.61.770.422.520.332.950.273.190.223.410.23.28
×