ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 88 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce.extracted | Innermost | 16.93 | 11.11 | 42.24 | 1.00 | 2.59 | 5.18 | 1.00 | 3 | 35.93 | 16.99 | 181.75 | 5.17 | 5.17 | 1.99 | 1.00 | 5.17 |
○Loop 57 | exec | haloExchange.c:621-629 | sortAtomsInCell | Single | 0.89 | 0.68 | 2.6 | 1.60 | 1.00 | 5.55 | 2.67 | 1 | 50.00 | 22.92 | 0.00 | 2.67 | 1.67 | 2.67 | 0.48 | 1.00 |
○Loop 95 | exec | timestep.c:74-78 | advanceVelocity.extracted | Innermost | 0.93 | 0.62 | 2.36 | 1.00 | 1.07 | 2.09 | 5.11 | 1 | 94.44 | 43.75 | 5.84 | 15.33 | 15.33 | 14.33 | 7.33 | 3.00 |
○Loop 90 | exec | mytype.h:23-23,ljForce.c:157-158 | ljForce.extracted.27 | Single | 0.67 | 0.49 | 1.85 | 2.00 | 1.00 | 6.40 | 8.00 | 1 | 50.00 | 15.63 | 0.00 | 8.00 | 4.00 | 8.00 | 1.25 | 1.00 |
○Loop 99 | exec | timestep.c:88-94 | advancePosition.extracted | Innermost | 0.52 | 0.38 | 1.46 | 1.05 | 1.09 | 1.27 | 1.27 | 1 | 95.12 | 43.29 | 5.07 | 10.17 | 9.67 | 9.33 | 8.00 | 8.00 |
○Loop 87 | exec | ljForce.c:187-187,ljForce.c:197-197 | ljForce.extracted | InBetween | 0.17 | 0.1 | 0.38 | 1.00 | 1.00 | 8.00 | 3.33 | 3 | 0.00 | 12.50 | 291.11 | 1.67 | 1.67 | 1.67 | 0.21 | 0.50 |
○Loop 94 | exec | timestep.c:74-78 | advanceVelocity.extracted | Innermost | 0.24 | 0.09 | 0.32 | 1.11 | 1.25 | 6.00 | 1.67 | 1 | 50.00 | 18.75 | 10.99 | 1.67 | 1.50 | 1.33 | 0.28 | 1.00 |
○Loop 82 | exec | linkCells.c:209-221,linkCells.c:227-247,linkCells.c:258-269,linkCells.c:295-301,linkCells.c:327-334,linkCells.c:352-353,linkCells.c:359-365,linkCells.c:371-371 | updateLinkCells | Innermost | 2.92 | 0.05 | 0.2 | 2.65 | 2.55 | 11.22 | 3.35 | 70 | 34.43 | 14.45 | 47.42 | 31.83 | 12.00 | 12.49 | 2.84 | 9.50 |
○Loop 86 | exec | ljForce.c:178-184,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-197 | ljForce.extracted | InBetween | 0.09 | 0.04 | 0.17 | 1.00 | 1.00 | 13.89 | 1.89 | 4 | 0.00 | 10.16 | 348.50 | 3.54 | 3.54 | 3.54 | 0.25 | 1.88 |
○Loop 105 | exec | timestep.c:110-116 | kineticEnergy.extracted | Innermost | 0.05 | 0.03 | 0.11 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.65 | 10.30 | 8.00 | 8.00 | 8.00 | 8.00 | 8.00 |
○Loop 98 | exec | timestep.c:88-94 | advancePosition.extracted | Innermost | 0.05 | 0.02 | 0.06 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 37.50 | 17.19 | 8.78 | 4.00 | 4.00 | 2.00 | 2.00 | 4.00 |
○Loop 93 | exec | timestep.c:71-78 | advanceVelocity.extracted | Outermost | 0.05 | 0.01 | 0.03 | 1.00 | 1.00 | 15.16 | 2.25 | 8 | 0.00 | 8.93 | 46.45 | 9.00 | 9.00 | 9.00 | 0.59 | 4.00 |
○Loop 45 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.36 | 0.01 | 0.02 | 1.58 | 1.06 | 7.24 | 3.17 | 1 | 30.77 | 13.94 | 13.10 | 3.17 | 2.00 | 3.00 | 0.44 | 1.00 |
○Loop 58 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.05 | 0.01 | 0.02 | 1.50 | 1.00 | 6.86 | 3.00 | 1 | 33.33 | 14.58 | 0.00 | 3.00 | 2.00 | 3.00 | 0.44 | 1.00 |
○Loop 97 | exec | timestep.c:85-94 | advancePosition.extracted | Outermost | 0.03 | 0.01 | 0.02 | 1.00 | 1.00 | 15.09 | 3.14 | 8 | 0.00 | 10.71 | 30.55 | 11.00 | 11.00 | 11.00 | 0.73 | 3.50 |
○Loop 70 | exec | initAtoms.c:177-181 | setTemperature.extracted | Innermost | 0.01 | 0 | 0.01 | 1.00 | 1.07 | 2.03 | 3.56 | 1 | 95.83 | 46.35 | NA | 10.67 | 10.67 | 10.00 | 5.25 | 3.00 |
○Loop 85 | exec | ljForce.c:172-175,ljForce.c:180-182,ljForce.c:187-187 | ljForce.extracted | Outermost | 0.02 | 0 | 0.01 | 1.00 | 1.00 | 15.70 | 1.07 | 2 | 0.00 | 7.70 | NA | 8.83 | 8.83 | 8.83 | 0.56 | 8.25 |
○Loop 67 | exec | initAtoms.c:221-228 | computeVcm.extracted | Innermost | 0.01 | 0 | 0.01 | 1.04 | 1.19 | 2.58 | 2.61 | 1 | 88.00 | 36.50 | NA | 7.83 | 7.50 | 6.58 | 3.04 | 3.00 |
○Loop 104 | exec | timestep.c:110-116 | kineticEnergy.extracted | Innermost | 0.01 | 0 | 0.01 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 4.00 | 4.00 | 2.00 | 2.00 | 4.00 |
○Loop 74 | exec | initAtoms.c:197-202 | randomDisplacements.extracted | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 102 | exec | timestep.c:152-154 | redistributeAtoms.extracted | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 64 | exec | initAtoms.c:126-133 | setVcm.extracted | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 91 | exec | random.c:27-29,random.c:45-48 | gasdev | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 81 | exec | linkCells.c:229-229,linkCells.c:291-297,linkCells.c:329-329,linkCells.c:353-353,linkCells.c:359-359,linkCells.c:365-365 | updateLinkCells | Outermost | 0.06 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 46 | exec | haloExchange.c:414-424 | unloadAtomsBuffer | Single | 0.05 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 72 | exec | initAtoms.c:154-162 | setTemperature.extracted.30 | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 103 | exec | timestep.c:107-114 | kineticEnergy.extracted | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 44 | exec | haloExchange.c:376-384,haloExchange.c:387-387 | loadAtomsBuffer | Outermost | 0.02 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 69 | exec | initAtoms.c:177-181 | setTemperature.extracted | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 80 | exec | linkCells.c:151-153,linkCells.c:209-215,linkCells.c:221-221,linkCells.c:227-237,linkCells.c:246-247 | getNeighborBoxes | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 66 | exec | initAtoms.c:221-228 | computeVcm.extracted | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 68 | exec | initAtoms.c:174-179 | setTemperature.extracted | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 61 | exec | initAtoms.c:90-100 | createFccLattice | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 65 | exec | initAtoms.c:218-228 | computeVcm.extracted | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 79 | exec | linkCells.c:150-150,linkCells.c:215-215,linkCells.c:229-229 | getNeighborBoxes | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 63 | exec | initAtoms.c:126-133 | setVcm.extracted | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 73 | exec | initAtoms.c:194-199 | randomDisplacements.extracted | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 71 | exec | initAtoms.c:151-157 | setTemperature.extracted.30 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |