ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 92 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce.extracted | Innermost | 12.12 | 9.12 | 43.96 | 1.00 | 2.51 | 5.18 | 1.00 | 3 | 35.93 | 16.99 | 222.82 | 5.17 | 5.17 | 2.06 | 1.00 | 5.17 |
○Loop 88 | exec | mytype.h:23-23,ljForce.c:158-162 | ljForce.extracted.25 | Single | 1.06 | 0.79 | 3.83 | 2.00 | 1.00 | 6.40 | 8.00 | 1 | 50.00 | 15.63 | 0.00 | 8.00 | 4.00 | 8.00 | 1.25 | 1.00 |
○Loop 98 | exec | timestep.c:74-78 | advanceVelocity.extracted | Innermost | 0.96 | 0.64 | 3.1 | 1.00 | 1.08 | 2.31 | 6.22 | 1 | 92.31 | 40.71 | 5.11 | 18.67 | 18.67 | 17.33 | 8.08 | 3.00 |
○Loop 58 | exec | haloExchange.c:621-629 | sortAtomsInCell | Single | 0.83 | 0.6 | 2.91 | 1.60 | 1.00 | 5.55 | 2.67 | 1 | 50.00 | 22.92 | 0.00 | 2.67 | 1.67 | 2.67 | 0.48 | 1.00 |
○Loop 102 | exec | timestep.c:88-94 | advancePosition.extracted | Innermost | 0.55 | 0.39 | 1.88 | 1.06 | 1.11 | 1.42 | 1.42 | 1 | 84.44 | 38.06 | 6.42 | 11.33 | 10.67 | 10.17 | 8.00 | 8.00 |
○Loop 97 | exec | timestep.c:74-78 | advanceVelocity.extracted | Innermost | 0.17 | 0.08 | 0.4 | 1.11 | 1.25 | 6.00 | 1.67 | 1 | 50.00 | 18.75 | 11.57 | 1.67 | 1.50 | 1.33 | 0.28 | 1.00 |
○Loop 91 | exec | ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-197 | ljForce.extracted | InBetween | 0.14 | 0.08 | 0.39 | 1.00 | 1.00 | 8.00 | 3.00 | 3 | 0.00 | 12.50 | 414.24 | 2.00 | 2.00 | 2.00 | 0.25 | 0.67 |
○Loop 84 | exec | linkCells.c:295-301,linkCells.c:352-365,linkCells.c:371-371,linkCells.c:378-378 | updateLinkCells | Innermost | 2.14 | 0.04 | 0.18 | 1.19 | 1.45 | 9.54 | 2.83 | 24 | 0.00 | 10.99 | 35.99 | 11.33 | 9.50 | 7.81 | 1.19 | 4.00 |
○Loop 108 | exec | timestep.c:110-116 | kineticEnergy.extracted | Innermost | 0.06 | 0.03 | 0.15 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 84.00 | 40.00 | 8.87 | 8.00 | 8.00 | 8.00 | 8.00 | 8.00 |
○Loop 90 | exec | ljForce.c:178-184,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-197 | ljForce.extracted | InBetween | 0.06 | 0.02 | 0.11 | 1.00 | 1.00 | 14.08 | 1.83 | 5 | 0.00 | 9.38 | 440.16 | 3.67 | 3.67 | 3.67 | 0.26 | 2.00 |
○Loop 101 | exec | timestep.c:88-94 | advancePosition.extracted | Innermost | 0.05 | 0.02 | 0.11 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 37.50 | 17.19 | 13.30 | 4.00 | 4.00 | 2.00 | 2.00 | 4.00 |
○Loop 96 | exec | timestep.c:71-80 | advanceVelocity.extracted | Outermost | 0.04 | 0.01 | 0.04 | 2.67 | 1.00 | 14.93 | 2.67 | 8 | 0.00 | 10.16 | 44.20 | 9.33 | 3.50 | 9.33 | 0.63 | 3.50 |
○Loop 48 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.46 | 0.01 | 0.04 | 1.58 | 1.06 | 7.24 | 3.17 | 1 | 30.77 | 13.94 | 14.55 | 3.17 | 2.00 | 3.00 | 0.44 | 1.00 |
○Loop 100 | exec | timestep.c:85-96 | advancePosition.extracted | Outermost | 0.03 | 0.01 | 0.03 | 1.00 | 1.00 | 14.62 | 3.52 | 8 | 0.00 | 11.11 | 25.60 | 12.33 | 12.33 | 12.33 | 0.84 | 3.50 |
○Loop 59 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.02 | 0.01 | 0.03 | 1.50 | 1.00 | 6.86 | 3.00 | 1 | 33.33 | 14.58 | 0.00 | 3.00 | 2.00 | 3.00 | 0.44 | 1.00 |
○Loop 79 | exec | random.c:45-48,random.c:68-70,initAtoms.c:197-202 | randomDisplacements.extracted | Innermost | 0.01 | 0 | 0.02 | 9.45 | 1.45 | 8.00 | 18.90 | 1 | 2.78 | 12.85 | NA | 37.80 | 4.00 | 26.07 | 4.72 | 2.00 |
○Loop 107 | exec | timestep.c:110-116 | kineticEnergy.extracted | Innermost | 0.01 | 0 | 0.01 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 4.00 | 4.00 | 2.00 | 2.00 | 4.00 |
○Loop 69 | exec | initAtoms.c:126-133 | setVcm.extracted | Innermost | 0.01 | 0 | 0.01 | 1.08 | 1.12 | 2.67 | 5.78 | 1 | 81.82 | 37.12 | NA | 8.67 | 8.00 | 7.75 | 3.25 | 1.50 |
○Loop 89 | exec | ljForce.c:173-175,ljForce.c:178-180,ljForce.c:187-187,ljForce.c:213-213,ljForce.c:222-222 | ljForce.extracted | Outermost | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 14.48 - 14.48 | 1.06 - 1.00 | 1 | 0.00 | 8.75 | NA | 3.17 - 5.00 | 3.17 - 5.00 | 3.17 - 5.00 | 0.22 - 0.35 | 3.00 - 5.00 |
○Loop 77 | exec | initAtoms.c:177-181 | setTemperature.extracted | Innermost | 0 | 0 | 0.01 | 1.00 | 1.09 | 2.37 | 4.67 | 1 | 92.59 | 41.67 | NA | 14.00 | 14.00 | 12.83 | 5.92 | 3.00 |
○Loop 49 | exec | haloExchange.c:414-424 | unloadAtomsBuffer | Single | 0.09 | 0 | 0.01 | 1.33 | 1.00 | 9.14 | 2.67 | 1 | 0.00 | 11.25 | NA | 2.67 | 2.00 | 2.67 | 0.29 | 1.00 |
○Loop 105 | exec | timestep.c:153-154 | redistributeAtoms.extracted | Single | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 10.50 | 7.00 | 1 | 0.00 | 12.13 | NA | 7.00 | 7.00 | 7.00 | 0.67 | 1.00 |
○Loop 106 | exec | timestep.c:107-114,timestep.c:118-118 | kineticEnergy.extracted | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 83 | exec | linkCells.c:291-295,linkCells.c:353-354,linkCells.c:359-359,linkCells.c:365-365,linkCells.c:371-371 | updateLinkCells | Outermost | 0.04 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 71 | exec | random.c:27-29,random.c:45-48 | setTemperature.extracted.30 | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 74 | exec | random.c:29-31,random.c:46-46,random.c:70-70,initAtoms.c:154-162 | setTemperature.extracted.30 | InBetween | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 68 | exec | initAtoms.c:126-133 | setVcm.extracted | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 76 | exec | initAtoms.c:177-181 | setTemperature.extracted | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 78 | exec | initAtoms.c:195-200,initAtoms.c:204-204 | randomDisplacements.extracted | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 47 | exec | haloExchange.c:376-384,haloExchange.c:387-387 | loadAtomsBuffer | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 63 | exec | initAtoms.c:91-100 | createFccLattice | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 72 | exec | random.c:27-29,random.c:45-48 | setTemperature.extracted.30 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 75 | exec | initAtoms.c:174-179,initAtoms.c:183-183 | setTemperature.extracted | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 66 | exec | initAtoms.c:218-228 | computeVcm.extracted | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 67 | exec | initAtoms.c:123-135 | setVcm.extracted | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 73 | exec | random.c:27-29,random.c:45-48 | setTemperature.extracted.30 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 70 | exec | random.c:27-27,initAtoms.c:152-160,initAtoms.c:164-164 | setTemperature.extracted.30 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |