options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_16_threads (%) Exclusive Coverage run_32_threads (%) Exclusive Coverage run_48_threads (%) Exclusive Coverage run_64_threads (%) Exclusive Coverage run_80_threads (%) Exclusive Coverage run_96_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_16_threads (%) Inclusive Coverage run_32_threads (%) Inclusive Coverage run_48_threads (%) Inclusive Coverage run_64_threads (%) Inclusive Coverage run_80_threads (%) Inclusive Coverage run_96_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_80_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_80_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_80_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_80_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Nb Threads run_80_threads Nb Threads run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads GFLOPS run_80_threads GFLOPS run_96_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_16_threads Speedup If Perfect Load Balancing run_32_threads Speedup If Perfect Load Balancing run_48_threads Speedup If Perfect Load Balancing run_64_threads Speedup If Perfect Load Balancing run_80_threads Speedup If Perfect Load Balancing run_96_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_80_threads) Efficiency (run_80_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_16_threads (%)Exclusive Coverage run_32_threads (%)Exclusive Coverage run_48_threads (%)Exclusive Coverage run_64_threads (%)Exclusive Coverage run_80_threads (%)Exclusive Coverage run_96_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_16_threads (%)Inclusive Coverage run_32_threads (%)Inclusive Coverage run_48_threads (%)Inclusive Coverage run_64_threads (%)Inclusive Coverage run_80_threads (%)Inclusive Coverage run_96_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_80_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_80_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_80_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_80_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsNb Threads run_80_threadsNb Threads run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsGFLOPS run_80_threadsGFLOPS run_96_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_16_threadsSpeedup If Perfect Load Balancing run_32_threadsSpeedup If Perfect Load Balancing run_48_threadsSpeedup If Perfect Load Balancing run_64_threadsSpeedup If Perfect Load Balancing run_80_threadsSpeedup If Perfect Load Balancing run_96_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_80_threads) Efficiency(run_80_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
4kmeans-gcc-O3-soa - main_soa.cpp:62-66k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]Innermost85.0785.0185.0285.3085.0885.1585.0985.2185.1484.9585.0785.0185.0285.3085.0885.1585.0985.2185.1484.95157.7979.1439.7119.9810.045.123.452.632.101.76157.7979.1439.7119.9810.045.123.452.632.101.76157.7982.2644.2525.4315.9211.189.578.838.378.03157.7982.2644.2525.4315.9211.189.578.838.378.0312481632486480964.869.3217.3230.1448.1368.4279.9786.8491.6595.08047.2211.412.4111.011.011.011.021.031.041.031.05000000.00100.963.480.899.220.7819.130.6232.370.4447.580.3455.870.2861.430.2465.070.267.56
5kmeans-gcc-O3-soa - main_soa.cpp:60-69k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone ._omp_fn.0]InBetween10.9210.9910.9710.6910.8610.7910.8110.6910.6710.8295.9996.0095.9995.9995.9495.9495.9095.9095.8295.7820.2510.255.322.681.410.730.520.440.330.29178.0589.3344.6222.3811.235.663.792.862.291.9220.2510.645.713.192.031.421.221.111.051.02178.0592.9049.9628.6217.9512.5910.799.949.419.0512481632486480965.3210.1719.0234.0553.5877.6389.2396.40102.53106.20042.861.3813111.041.081.111.151.221.391.321.350002050.00100.950.530.891.240.792.20.624.090.455.970.357.060.297.640.248.10.218.59
11kmeans-gcc-O3-soa - main_soa.cpp:81-84k_means(int, point_t&, point_t&, int*, point_t&, int, int)Innermost4.014.004.003.984.003.983.953.953.963.954.014.004.003.984.003.983.953.953.963.957.457.437.447.427.487.497.467.517.547.537.457.437.447.427.487.497.467.517.547.537.453.872.081.190.750.520.440.410.390.377.453.872.081.190.750.520.440.410.390.3711111111110.671.292.404.216.689.5811.2612.2212.8513.38043.181.581.122.451111111111000030.00100.960.150.890.430.780.860.621.510.452.210.352.570.282.830.243.010.213.13
×