| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 95 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce._omp_fn.1 | Innermost | 11.26 | 12.24 | 83.60 | 2.02 | 1.98 | 3.86 | 1.20 | 9 | 11.59 | 27.90 | 314.08 | 13.50 | 6.67 | 6.83 | 3.50 | 11.25 |
| ○Loop 94 | exec | ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-198,ljForce.c:201-201,ljForce.c:206-210,ljForce.c:213-216 | ljForce._omp_fn.1 | InBetween | 0.92 | 0.96 | 6.54 | 2.04 | 1.99 | 4.07 | 1.30 | 40 | 10.26 | 27.24 | 289.65 | 7.13 | 3.50 | 3.58 | 1.75 | 5.50 |
| ○Loop 90 | exec | mytype.h:22-24,ljForce.c:161-161 | ljForce._omp_fn.0 | Single | 0.30 | 0.29 | 1.95 | 1.00 | 1.00 | 1.50 | 12.00 | 1 | 100.00 | 66.67 | 0.00 | 12.00 | 12.00 | 12.00 | 8.00 | 1.00 |
| ○Loop 106 | exec | timestep.c:74-76 | advanceVelocity._omp_fn.0 | Innermost | 0.25 | 0.22 | 1.51 | 1.00 | 1.33 | 1.33 | 2.67 | 1 | 100.00 | 66.67 | 17.24 | 8.00 | 8.00 | 6.00 | 6.00 | 3.00 |
| ○Loop 103 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Innermost | 0.18 | 0.18 | 1.20 | 1.10 | 1.00 | 4.19 - 2.10 | 2.09 - 1.05 | 1 | 0.00 | 24.31 | 22.35 | 7.33 | 6.67 | 7.33 | 1.75 - 3.50 | 3.50 - 6.99 |
| ○Loop 63 | exec | haloExchange.c:623-628 | sortAtomsInCell | Single | 0.17 | 0.14 | 0.97 | 1.00 | 1.00 | 2.80 | 7.00 | 1 | 30.00 | 35.00 | 0.00 | 14.00 | 14.00 | 14.00 | 5.00 | 2.00 |
| ○Loop 104 | exec | timestep.c:71-71,timestep.c:74-78 | advanceVelocity._omp_fn.0 | Outermost | 0.12 | 0.10 | 0.70 | 1.18 | 1.32 | 1.67 | 3.74 | 22 | 29.81 | 35.70 | 15.76 | 19.63 | 16.67 | 14.88 | 11.78 | 5.25 |
| ○Loop 86 | exec | linkCells.c:211-247,linkCells.c:295-301,linkCells.c:352-373 | updateLinkCells | Innermost | 3.20 | 0.07 | 0.46 | 4.38 | 1.39 | 7.13 | 4.38 | 56 | 6.94 | 18.06 | 74.87 | 17.50 | 4.00 | 12.63 | 2.45 | 4.00 |
| ○Loop 93 | exec | ljForce.c:178-184,ljForce.c:187-187,ljForce.c:191-191 | ljForce._omp_fn.1 | InBetween | 0.08 | 0.06 | 0.41 | 1.00 | 1.00 | 5.87 | 5.50 | 41 | 0.00 | 18.06 | 240.17 | 2.75 | 2.75 | 2.75 | 0.47 | 0.50 |
| ○Loop 61 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.04 | 0.02 | 0.15 | 1.33 | 1.00 | 4.57 | 10.67 | 1 | 0.00 | 21.88 | 0.00 | 10.67 | 8.00 | 10.67 | 2.33 | 1.00 |
| ○Loop 102 | exec | timestep.c:88-88,timestep.c:91-94 | advancePosition._omp_fn.0 | Outermost | 0.03 | 0.01 | 0.10 | 1.38 | 1.22 | 5.07 - 3.64 | 3.00 - 1.83 | 4 | 0.00 | 22.09 | 19.42 | 4.13 | 3.00 | 3.38 | 0.81 - 1.13 | 1.38 - 2.25 |
| ○Loop 109 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Innermost | 0.03 | 0.01 | 0.10 | 1.00 | 1.00 | 1.05 - 1.00 | 1.00 | 1 | 81.82 | 93.18 | 24.22 | 8.00 - 14.08 | 8.00 - 14.08 | 8.00 - 14.08 | 7.60 - 14.08 | 8.00 - 14.08 |
| ○Loop 34 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.50 | 0.01 | 0.07 | 1.13 | 1.13 | 4.69 | 5.67 | 1 | 0.00 | 22.37 | 9.69 | 5.67 | 5.00 | 5.00 | 1.21 | 1.00 |
| ○Loop 91 | exec | ljForce.c:175-175,ljForce.c:178-180,ljForce.c:187-187 | ljForce._omp_fn.1 | Outermost | 0.02 | 0.00 | 0.03 | 1.00 | 1.00 | 1.96 | 1.75 | 2 | 0.00 | 24.31 | 89.18 | 1.75 | 1.75 | 1.75 | 0.89 | 1.00 |
| ○Loop 101 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0.00 | 0.01 | 1.00 | 1.25 | 4.00 - 2.50 | 1.00 | 1 | 0.00 | 25.00 | 27.84 | 20.00 - 50.00 | 20.00 - 50.00 | 14.55 - 40.00 | 5.00 - 20.00 | 20.00 - 50.00 |
| ○Loop 74 | exec | mytype.h:22-24,initAtoms.c:39-46 | initAtoms | Single | 0.05 | 0.00 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 65 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Innermost | 0.00 | 0.00 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 2.98 | NA | NA | NA | NA | NA |
| ○Loop 69 | exec | initAtoms.c:179-181 | setTemperature._omp_fn.1 | Innermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 33 | exec | haloExchange.c:376-383,haloExchange.c:390-390 | loadAtomsBuffer | Outermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 29.83 | NA | NA | NA | NA | NA |
| ○Loop 85 | exec | linkCells.c:291-295 | updateLinkCells | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 140.80 | NA | NA | NA | NA | NA |
| ○Loop 110 | exec | timestep.c:154-154 | redistributeAtoms._omp_fn.0 | Single | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 81 | exec | linkCells.c:152-153,linkCells.c:209-247 | getNeighborBoxes | Innermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 76 | exec | initAtoms.c:90-100 | createFccLattice | Innermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 77.56 | NA | NA | NA | NA | NA |
| ○Loop 68 | exec | initAtoms.c:177-181 | setTemperature._omp_fn.1 | Outermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 45 | exec | haloExchange.c:414-414,haloExchange.c:424-424 | unloadAtomsBuffer | Single | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 143.19 | NA | NA | NA | NA | NA |
| ○Loop 108 | exec | timestep.c:110-110,timestep.c:114-114 | kineticEnergy._omp_fn.0 | Outermost | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 92 | exec | ljForce.c:172-172 | ljForce._omp_fn.1 | Single | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |