options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_10_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_10_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_10_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_10_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_10_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_10_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_10_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_10_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
k_means(int, point_t&, point_t&, int*, point_t&, int, int) [clone .omp_outlined]+kmeans-clang-O3-ffast-math-soa95.0392.1187.3079.1275.490.000.000.000.000.00143.2672.3038.4719.9115.950.000.000.000.000.00143.2673.3940.3422.0218.020.000.000.000.000.001248100.002.272.973.363.350.000.110.070.050.04Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.005.2310.2218.5934.0641.62clang version 20.1.6 100.982.210.899.80.8114.770.815.47
Loop 8 - main_soa.cpp:58-69 - kmeans-clang-O3-ffast-math-soa+95.0392.1187.3079.1275.492.132.202.021.891.86143.2672.3738.7920.1116.083.211.731.000.520.45143.2673.3940.3422.0218.023.211.760.930.520.441248100.000.050.240.130.190.000.000.100.030.037.2013.1324.5244.3951.58100.910.190.860.280.760.440.720.52
Loop 9 - main_soa.cpp:62-67 - kmeans-clang-O3-ffast-math-soa88.3785.5281.1473.6369.9788.3785.5281.1473.6369.97133.2267.1635.9118.5614.81133.2267.1635.9118.5614.81133.2268.1337.5020.4916.70133.2268.1337.5020.4916.701248100.002.182.793.223.160.000.170.170.090.075.2610.2818.6734.1741.91100.981.910.899.070.8113.790.814.16
Loop 10 - main_soa.cpp:62-67 - kmeans-clang-O3-ffast-math-soa4.534.394.143.613.664.534.394.143.613.666.833.481.881.030.826.833.481.881.030.826.833.501.911.000.876.833.501.911.000.871248100.000.040.160.280.260.000.050.060.070.053.917.6314.0526.3330.90100.980.110.890.440.850.540.780.8
k_means(int, point_t&, point_t&, int*, point_t&, int, int)+kmeans-clang-O3-ffast-math-soa4.954.734.413.933.750.000.000.000.000.007.477.417.767.887.890.000.000.000.000.007.473.772.041.090.900.000.000.000.000.00111110.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.671.332.454.575.58clang version 20.1.6100.990.040.920.370.850.570.830.62
Loop 4 - main_soa.cpp:56-93 - kmeans-clang-O3-ffast-math-soa [...]+4.954.734.413.933.750.000.000.000.000.007.477.417.767.887.890.000.000.000.000.007.473.772.041.090.900.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main_soa.cpp:86-93 - kmeans-clang-O3-ffast-math-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 5 - main_soa.cpp:86-93 - kmeans-clang-O3-ffast-math-soa0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main_soa.cpp:81-84 - kmeans-clang-O3-ffast-math-soa4.954.734.413.933.754.954.734.413.933.757.477.417.767.887.897.477.417.767.887.897.473.772.041.090.907.473.772.041.090.90111110.000.000.000.000.000.000.000.000.000.000.671.332.454.575.58100.990.040.920.370.850.570.830.62
unknown_kernel_regionkernel0.020.010.020.030.030.000.000.000.000.000.020.010.030.030.010.000.000.000.000.000.030.010.010.010.010.000.000.000.000.00114580.000.000.030.050.010.000.000.010.010.00System (%): 100.00System (%): 100.00Pthread (%): 85.71
System (%): 14.29
System (%): 58.33
Pthread (%): 41.67
Pthread (%): 66.67
System (%): 33.33
0.000.001.091.200.00
__kmpc_threadprivate_register_veclibomp.so0.002.486.4613.3916.230.002.486.4613.3916.230.003.803.833.903.850.003.803.833.903.850.001.982.983.733.870.001.982.983.733.870248100.003.404.265.315.600.002.621.841.311.16NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
__kmp_invoke_microtasklibomp.so0.000.661.793.494.460.000.661.793.494.460.001.011.071.091.080.001.011.071.091.080.000.530.830.971.060.000.530.830.971.060248100.000.901.171.391.520.000.690.500.340.31NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
__sched_yieldlibc.so.60.000.000.010.020.010.000.000.010.020.010.000.000.010.010.010.000.000.010.010.010.000.000.010.010.000.000.000.010.010.00003440.000.000.010.020.010.000.000.000.000.00NANAPthread (%): 100.00Pthread (%): 100.00Pthread (%): 100.000.000.000.000.000.00
×