| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 16 | exec | cg.cpp:86-90 | cg_calc_w(int, int, int, double*, double const*, double*, double const*, double const*) [clone .omp_outlined] | Innermost | 1439.65 | 1439.65 | 55.35 | 1.00 | 1.00 | 3.67 | 1.33 | 1 | 4.35 | 26.09 | 7.05 | 3.67 | 3.67 | 3.67 | 1.00 | 2.75 |
| ○Loop 19 | exec | cg.cpp:108-113 | cg_calc_ur(int, int, int, double, double*, double*, double const*, double*, double const*) [clone .omp_outlined] | Innermost | 843.51 | 843.52 | 32.43 | 1.00 | 1.00 | 4.00 | 1.58 | 1 | 0.00 | 25.00 | 4.81 | 3.17 | 3.17 | 3.17 | 0.79 | 2.00 |
| ○Loop 23 | exec | cg.cpp:128-131 | cg_calc_p(int, int, int, double, double*, double const*) [clone .omp_outlined] | Innermost | 303.33 | 303.33 | 11.66 | 1.00 | 1.87 | 1.00 | 1.17 | 1 | 83.33 | 70.83 | 4.45 | 1.17 | 1.17 | 0.63 | 1.17 | 1.00 |
| ○Loop 70 | exec | local_halos.cpp:13-15 | update_left(int, int, int, int, double*, bool) [clone .omp_outlined] | Innermost | 5.68 | 5.68 | 0.22 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 0.00 | 1.00 | 1.00 | 1.00 | 0.25 | 1.00 |
| ○Loop 17 | exec | cg.cpp:85-85 | cg_calc_w(int, int, int, double*, double const*, double*, double const*, double const*) [clone .omp_outlined] | Outermost | 1.79 | 1.79 | 0.07 | 1.00 | 1.00 | 2.81 | 2.38 | 1 | 0.00 | 31.94 | 5.34 | 4.75 | 4.75 | 4.75 | 1.69 | 2.00 |
| ○Loop 72 | exec | local_halos.cpp:28-30 | update_right(int, int, int, int, double*, bool) [clone .omp_outlined] | Innermost | 1.52 | 1.52 | 0.06 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 0.00 | 1.00 | 1.00 | 1.00 | 0.25 | 1.00 |
| ○Loop 22 | exec | cg.cpp:127-128,cg.cpp:131-131 | cg_calc_p(int, int, int, double, double*, double const*) [clone .omp_outlined] | Outermost | 1.13 | 1.13 | 0.04 | 5.29 | 1.00 | 2.45 | 5.29 | 5 | 0.00 | 27.02 | 3.94 | 9.25 | 1.75 | 9.25 | 3.78 | 1.75 |
| ○Loop 13 | exec | cg.cpp:62-68 | cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined.6] | Innermost | 0.47 | 0.48 | 0.02 | 1.00 | 1.00 | 3.73 | 1.56 | 1 | 3.70 | 25.93 | 5.38 | 4.67 | 4.67 | 4.67 | 1.25 | 3.00 |
| ○Loop 20 | exec | cg.cpp:107-107 | cg_calc_ur(int, int, int, double, double*, double*, double const*, double*, double const*) [clone .omp_outlined] | Outermost | 0.43 | 0.43 | 0.02 | 1.00 | 1.00 | 4.71 | 1.25 | 1 | 0.00 | 22.22 | 5.27 | 2.50 | 2.50 | 2.50 | 0.53 | 2.00 |
| ○Loop 11 | exec | cg.cpp:47-50 | cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined.4] | Innermost | 0.32 | 0.32 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 93.75 | 48.44 | 5.00 | 6.99 - 13.99 | 6.99 - 13.99 | 6.99 - 13.99 | 6.99 - 13.99 | 6.99 - 13.99 |