ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 99 | exec | ljForce.c:191-191,ljForce.c:197-216 | .omp_outlined..5#0x408b40 | Innermost | 41.91 | 41.91 | 95.08 | 1.28 | 1.28 | 3.57 | 1.34 | 2 | 30.30 | 32.58 | 4.69 | 3.67 | 3.67 | 1.31 | 3.50 |
○Loop 98 | exec | ljForce.c:187-187 | .omp_outlined..5#0x408b40 | Outermost | 0.48 | 0.48 | 1.1 | 1.00 | 1.00 | 4.00 | 1.50 | 2 | 0.00 | 25.00 | 1.50 | 1.50 | 1.50 | 0.38 | 1.00 |
○Loop 33 | exec | timestep.c:74-76 | .omp_outlined.#0x40b270 | Single | 0.21 | 0.21 | 0.48 | 1.00 | 1.00 | 1.23 | 2.67 | 1 | 27.27 | 65.91 | 4.00 | 4.00 | 4.00 | 3.25 | 1.50 |
○Loop 81 | exec | haloExchange.c:621-630 | sortAtomsInCell | Single | 0.16 | 0.16 | 0.37 | 1.33 | 1.00 | 4.00 | 5.33 | 1 | 0.00 | 25.00 | 5.33 | 4.00 | 5.33 | 1.33 | 1.00 |
○Loop 139 | exec | timestep.c:88-94 | .omp_outlined..2 | Innermost | 0.13 | 0.13 | 0.31 | 1.08 | 1.00 | 4.00 | 2.57 | 1 | 7.69 | 25.96 | 4.50 | 4.17 | 4.50 | 1.13 | 1.75 |
○Loop 36 | exec | ljForce.c:158-161,mytype.h:23-23 | .omp_outlined.#0x408a30 | Single | 0.13 | 0.13 | 0.29 | 1.00 | 1.00 | 8.00 | 2.50 | 1 | 0.00 | 18.75 | 2.50 | 2.50 | 2.50 | 0.31 | 1.00 |
○Loop 67 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.08 | 0.08 | 0.19 | 1.42 | 1.00 | 4.00 | 5.67 | 1 | 0.00 | 25.00 | 5.67 | 4.00 | 5.67 | 1.42 | 1.00 |
○Loop 82 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.08 | 0.08 | 0.18 | 1.33 | 1.00 | 4.00 | 5.33 | 1 | 0.00 | 25.00 | 5.33 | 4.00 | 5.33 | 1.33 | 1.00 |
○Loop 35 | exec | timestep.c:74-78 | .omp_outlined.#0x40b270 | Innermost | 0.02 | 0.02 | 0.06 | 1.00 | 1.00 | 3.68 | 3.83 | 1 | 8.33 | 26.04 | 3.83 | 3.83 | 3.83 | 1.04 | 1.00 |
○Loop 135 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0.01 | 0.02 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 24.00 | 24.00 | 24.00 | 6.00 | 24.00 |
○Loop 116 | exec | parallel.c:107-107 | sendReceiveParallel | Single | 0.01 | 0.01 | 0.02 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 5.00 | 5.00 | 5.00 | 5.00 | 5.00 |
○Loop 41 | exec | timestep.c:153-154 | .omp_outlined..6#0x40b860 | Single | 0.01 | 0.01 | 0.02 | 1.00 | 1.00 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | 1.00 | 1.00 | 1.00 | 0.13 | 1.00 |
○Loop 34 | exec | timestep.c:71-74 | .omp_outlined.#0x40b270 | Outermost | 0.01 | 0.01 | 0.02 | 13.50 | 1.00 | 1.93 | 20.25 | 1 | 0.00 | 22.92 | 13.50 | 1.00 | 13.50 | 7.00 | 0.67 |
○Loop 142 | exec | timestep.c:110-116 | .omp_outlined..4 | Innermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 4.00 - 4.36 | 1.00 | 1 | 20.00 | 27.50 | 2.00 | 2.00 | 2.00 | 0.50 - 0.46 | 2.00 |
○Loop 105 | exec | initAtoms.c:177-179 | .omp_outlined..7 | Innermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 1.23 | 1.33 | 1 | 42.86 | 57.14 | 2.00 | 2.00 | 2.00 | 1.63 | 1.50 |
○Loop 85 | exec | initAtoms.c:46-46 | initAtoms | Single | 0 | 0 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 5.00 | 5.00 | 5.00 | 5.00 | 5.00 |
○Loop 110 | exec | linkCells.c:291-295 | updateLinkCells | Single | 0 | 0 | 0.01 | 1.00 | 1.00 | 4.00 | 3.00 | 1 | 0.00 | 25.00 | 1.00 | 1.00 | 1.00 | 0.25 | 0.33 |