ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 1 | kmeans-gcc-O3-funroll | main.cpp:61-67 | k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0] | Innermost | 142.26 | 142.26 | 87.41 | 1.00 | 2.50 | 8.00 | 1.25 | 16 | 14.29 | 13.39 | 4.12 | 10.00 | 10.00 | 4.00 | 1.25 | 8.00 |
○Loop 6 | kmeans-gcc-O3-funroll | main.cpp:81-84 | k_means(int, point_t*, point_t*, int*, point_t*, int, int) | Innermost | 10.89 | 10.89 | 6.69 | 1.20 | 1.00 | 9.60 | 3.00 | 1 | 0.00 | 11.61 | 0.46 | 12.00 | 10.00 | 12.00 | 1.25 | 4.00 |
○Loop 2 | kmeans-gcc-O3-funroll | main.cpp:60-67 | k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0] | Outermost | 9.59 | 9.59 | 5.89 | 1.47 | 2.30 | 11.82 | 1.68 | 254 | 12.12 | 11.74 | 4.08 | 11.75 | 8.00 | 5.11 | 0.99 | 7.00 |