| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 106 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce.omp_outlined.2 | Innermost | 13.33 | 11.10 | 55.26 | 1.35 | 1.33 | 3.48 | 1.35 | 1 | 15.00 | 28.75 | 377.81 | 6.75 | 5.00 | 5.08 | 1.94 | 5.00 |
| ○Loop 99 | exec | mytype.h:23-23,ljForce.c:158-161 | ljForce.omp_outlined | Single | 0.28 | 0.25 | 1.24 | 1.00 | 1.00 | 5.00 | 2.50 | 1 | 0.00 | 27.50 | 0.00 | 2.50 | 2.50 | 2.50 | 0.50 | 1.00 |
| ○Loop 113 | exec | timestep.c:74-78 | advanceVelocity.omp_outlined | Innermost | 0.28 | 0.20 | 1.00 | 1.00 | 1.57 | 1.38 | 2.75 | 1 | 73.33 | 53.33 | 22.51 | 2.75 | 2.75 | 1.75 | 2.00 | 1.00 |
| ○Loop 115 | exec | timestep.c:88-94 | advancePosition.omp_outlined | Innermost | 0.21 | 0.18 | 0.88 | 1.00 | 1.08 | 4.00 - 2.57 | 2.57 - 1.29 | 1 | 5.00 | 25.63 | 22.71 | 4.50 | 4.50 | 4.17 | 1.13 - 1.75 | 1.75 - 3.50 |
| ○Loop 69 | exec | haloExchange.c:621-630 | sortAtomsInCell | Single | 0.22 | 0.17 | 0.87 | 1.33 | 1.00 | 4.41 | 5.33 | 1 | 0.00 | 22.06 | 0.00 | 5.33 | 4.00 | 5.33 | 1.21 | 1.00 |
| ○Loop 105 | exec | ljForce.c:187-187 | ljForce.omp_outlined.2 | InBetween | 0.19 | 0.11 | 0.56 | 1.00 | 1.00 | 5.00 | 2.50 | 1 | 0.00 | 20.83 | 390.66 | 1.25 | 1.25 | 1.25 | 0.25 | 0.50 |
| ○Loop 94 | exec | linkCells.c:209-212,linkCells.c:215-218,linkCells.c:221-224,linkCells.c:227-229,linkCells.c:232-234,linkCells.c:237-239,linkCells.c:244-247,linkCells.c:258-269,linkCells.c:295-298,linkCells.c:301-301,linkCells.c:327-334,linkCells.c:352-354,linkCells.c:359-361,linkCells.c:364-365,linkCells.c:371-371 | updateLinkCells | InBetween | 4.20 | 0.07 | 0.34 | 3.20 | 1.57 | 4.44 | 13.07 | 1 | 15.84 | 22.28 | 81.66 | 24.50 | 7.67 | 15.63 | 5.52 | 1.88 |
| ○Loop 104 | exec | ljForce.c:178-184,ljForce.c:187-187 | ljForce.omp_outlined.2 | InBetween | 0.08 | 0.03 | 0.17 | 1.00 | 1.00 | 6.40 | 2.00 | 1 | 0.00 | 18.75 | 308.73 | 2.00 | 2.00 | 2.00 | 0.31 | 1.00 |
| ○Loop 49 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.55 | 0.01 | 0.04 | 1.42 | 1.00 | 4.69 | 5.67 | 1 | 0.00 | 22.28 | 13.70 | 5.67 | 4.00 | 5.67 | 1.21 | 1.00 |
| ○Loop 70 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.03 | 0.01 | 0.04 | 1.33 | 1.00 | 4.41 | 5.33 | 1 | 0.00 | 22.06 | 0.00 | 5.33 | 4.00 | 5.33 | 1.21 | 1.00 |
| ○Loop 112 | exec | timestep.c:74-78 | advanceVelocity.omp_outlined | Innermost | 0.03 | 0.01 | 0.04 | 1.00 | 1.00 | 3.68 | 3.83 | 1 | 7.14 | 26.79 | 17.13 | 3.83 | 3.83 | 3.83 | 1.04 | 1.00 |
| ○Loop 74 | exec | initAtoms.c:126-133 | setVcm.omp_outlined | Innermost | 0.01 | 0.01 | 0.03 | 1.00 | 1.11 | 4.00 | 3.50 | 1 | 7.69 | 25.96 | 2.96 | 3.50 | 3.50 | 3.17 | 0.88 | 1.00 |
| ○Loop 118 | exec | timestep.c:110-116 | kineticEnergy.omp_outlined | Innermost | 0.02 | 0.00 | 0.02 | 1.00 - 1.75 | 2.00 | 2.28 - 2.00 | 1.00 | 1 | 10.00 | 26.25 | 54.96 | 2.00 - 3.50 | 2.00 | 1.50 - 1.75 | 0.88 - 1.75 | 2.00 - 3.50 |
| ○Loop 71 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.02 | 0.00 | 0.02 | 1.00 | 1.00 | 2.14 | 10.17 | 1 | 72.97 | 47.30 | 0.00 | 10.17 | 10.17 | 10.17 | 4.75 | 1.00 |
| ○Loop 87 | exec | initAtoms.c:197-202 | randomDisplacements.omp_outlined | Innermost | 0.01 | 0.00 | 0.02 | 1.00 | 2.05 | 5.59 | 1.00 | 1 | 0.00 | 23.81 | 7.98 | 12.00 | 12.00 | 5.86 | 2.15 | 12.00 |
| ○Loop 103 | exec | ljForce.c:173-175 | ljForce.omp_outlined.2 | InBetween | 0.02 | 0.00 | 0.01 | 1.00 | 1.00 | 1.33 | 4.00 | 1 | 0.00 | 31.25 | 94.65 | 2.00 | 2.00 | 2.00 | 1.50 | 0.50 |
| ○Loop 82 | exec | initAtoms.c:154-162 | setTemperature.omp_outlined | Innermost | 0.01 | 0.00 | 0.01 | 3.23 - 3.34 | 2.00 | 4.00 - 2.00 | 2.00 - 1.00 | 1 | 0.00 | 23.61 | 43.69 | 7.00 - 7.24 | 2.17 | 3.50 - 3.62 | 1.75 - 3.62 | 3.50 - 7.24 |
| ○Loop 111 | exec | timestep.c:72-74 | advanceVelocity.omp_outlined | Outermost | 0.01 | 0.00 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 29.00 | NA | NA | NA | NA | NA |
| ○Loop 109 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0.00 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 63.06 | NA | NA | NA | NA | NA |
| ○Loop 114 | exec | timestep.c:86-88 | advancePosition.omp_outlined | Outermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 19.02 | NA | NA | NA | NA | NA |
| ○Loop 119 | exec | timestep.c:153-154 | redistributeAtoms.omp_outlined | Single | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 72 | exec | initAtoms.c:88-100 | createFccLattice | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 18.67 | NA | NA | NA | NA | NA |
| ○Loop 48 | exec | haloExchange.c:376-380 | loadAtomsBuffer | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 46.68 | NA | NA | NA | NA | NA |
| ○Loop 92 | exec | linkCells.c:291-295 | updateLinkCells | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 116.69 | NA | NA | NA | NA | NA |
| ○Loop 51 | exec | haloExchange.c:414-424 | unloadAtomsBuffer | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 42.79 | NA | NA | NA | NA | NA |
| ○Loop 91 | exec | linkCells.c:150-153,linkCells.c:209-218,linkCells.c:246-247 | getNeighborBoxes | Innermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 117 | exec | timestep.c:108-110 | kineticEnergy.omp_outlined | Outermost | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 90 | exec | linkCells.c:221-224,linkCells.c:227-229,linkCells.c:232-234,linkCells.c:237-239,linkCells.c:244-246 | getNeighborBoxes | Outermost | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |