| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
| ○Loop 19 | exec | cg.cpp:108-113 | cg_calc_ur(int, int, int, double, double*, double*, double const*, double*, double const*) [clone .omp_outlined] | Innermost | 116.31 | 114.92 | 43.27 | 1.00 | 1.00 | 4.00 | 1.58 | 1 | 0.00 | 25.00 | 35.42 | 3.17 | 3.17 | 3.17 | 0.79 | 2.00 |
| ○Loop 16 | exec | cg.cpp:86-90 | cg_calc_w(int, int, int, double*, double const*, double*, double const*, double const*) [clone .omp_outlined] | Innermost | 89.56 | 87.14 | 32.81 | 1.00 | 1.00 | 3.67 | 1.33 | 1 | 4.35 | 26.09 | 116.43 | 3.67 | 3.67 | 3.67 | 1.00 | 2.75 |
| ○Loop 23 | exec | cg.cpp:128-131 | cg_calc_p(int, int, int, double, double*, double const*) [clone .omp_outlined] | Innermost | 55.12 | 54.41 | 20.49 | 1.00 | 1.87 | 1.00 | 1.17 | 1 | 83.33 | 70.83 | 24.85 | 1.17 | 1.17 | 0.63 | 1.17 | 1.00 |
| ○Loop 17 | exec | cg.cpp:85-85 | cg_calc_w(int, int, int, double*, double const*, double*, double const*, double const*) [clone .omp_outlined] | Outermost | 0.19 | 0.13 | 0.05 | 1.00 | 1.00 | 2.81 | 2.38 | 1 | 0.00 | 31.94 | 46.36 | 4.75 | 4.75 | 4.75 | 1.69 | 2.00 |
| ○Loop 70 | exec | local_halos.cpp:13-15 | update_left(int, int, int, int, double*, bool) [clone .omp_outlined] | Innermost | 0.14 | 0.09 | 0.03 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 0.00 | 1.00 | 1.00 | 1.00 | 0.25 | 1.00 |
| ○Loop 22 | exec | cg.cpp:127-128,cg.cpp:131-131 | cg_calc_p(int, int, int, double, double*, double const*) [clone .omp_outlined] | Outermost | 0.10 | 0.06 | 0.02 | 5.29 | 1.00 | 2.45 | 5.29 | 5 | 0.00 | 27.02 | 55.36 | 9.25 | 1.75 | 9.25 | 3.78 | 1.75 |
| ○Loop 13 | exec | cg.cpp:62-68 | cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined.6] | Innermost | 0.05 | 0.05 | 0.02 | 1.00 | 1.00 | 3.73 | 1.56 | 1 | 3.70 | 25.93 | 57.96 | 4.67 | 4.67 | 4.67 | 1.25 | 3.00 |
| ○Loop 72 | exec | local_halos.cpp:28-30 | update_right(int, int, int, int, double*, bool) [clone .omp_outlined] | Innermost | 0.07 | 0.04 | 0.01 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 0.00 | 1.00 | 1.00 | 1.00 | 0.25 | 1.00 |
| ○Loop 0 | exec | cg.cpp:21-25 | cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined] | Innermost | 0.09 | 0.02 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 4.01 | NA | NA | NA | NA | NA |
| ○Loop 125 | exec | solver_methods.cpp:69-72 | calculate_residual(int, int, int, double const*, double const*, double*, double const*, double const*) [clone .omp_outlined] | Innermost | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 69.67 | NA | NA | NA | NA | NA |
| ○Loop 79 | exec | local_halos.cpp:60-62 | update_bottom(int, int, int, int, double*, bool) [clone .omp_outlined] | Single | 0.05 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 76 | exec | local_halos.cpp:44-46 | update_top(int, int, int, int, double*, bool) [clone .omp_outlined] | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 11 | exec | cg.cpp:47-50 | cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined.4] | Innermost | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 50.10 | NA | NA | NA | NA | NA |
| ○Loop 63 | exec | kernel_initialise.cpp:51-56,kernel_initialise.cpp:67-70 | run_set_chunk_state(Chunk*, Settings&, State*) | Innermost | 0.08 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 122.66 | NA | NA | NA | NA | NA |
| ○Loop 132 | exec | solver_methods.cpp:28-35 | run_field_summary(Chunk*, Settings&, double*, double*, double*, double*) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 411.41 | NA | NA | NA | NA | NA |
| ○Loop 131 | exec | solver_methods.cpp:103-105 | finalise(int, int, int, double*, double const*, double*) [clone .omp_outlined] | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |