ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 100 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce._omp_fn.1 | Innermost | 37.57 | 37.57 | 92.4 | 2.02 | 1.00 | 3.86 | 1.20 | 1 | 38.10 | 34.52 | 13.50 | 6.67 | 13.50 | 3.50 | 11.25 |
○Loop 99 | exec | ljForce.c:172-172,ljForce.c:178-178,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-197 | ljForce._omp_fn.1 | Outermost | 0.9 | 0.9 | 2.21 | 4.87 | 1.00 | 4.95 | 6.50 | 1 | 0.00 | 22.92 | 3.25 | 0.67 | 3.25 | 0.66 | 0.50 |
○Loop 126 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Innermost | 0.19 | 0.2 | 0.48 | 1.00 | 1.00 | 3.89 | 3.89 | 1 | 2.86 | 25.71 | 11.67 | 11.67 | 11.67 | 3.00 | 3.00 |
○Loop 66 | exec | haloExchange.c:621-630 | sortAtomsInCell | Innermost | 0.17 | 0.17 | 0.43 | 1.33 | 1.00 | 4.00 | 10.67 | 1 | 0.00 | 25.00 | 10.67 | 8.00 | 10.67 | 2.67 | 1.00 |
○Loop 122 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Innermost | 0.1 | 0.1 | 0.26 | 1.10 | 1.00 | 4.00 | 2.10 | 1 | 0.00 | 25.00 | 7.33 | 6.67 | 7.33 | 1.83 | 3.50 |
○Loop 98 | exec | ljForce.c:161-161,mytype.h:22-24 | ljForce._omp_fn.0 | Innermost | 0.09 | 0.09 | 0.23 | 1.00 | NA | NA | 13.00 | 1 | NA | NA | 13.00 | 13.00 | NA | NA | 1.00 |
○Loop 125 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | InBetween | 0.06 | 0.06 | 0.16 | 1.12 | 1.00 | 3.86 | 4.07 | 8 | 6.98 | 26.16 | 9.17 | 8.17 | 9.17 | 2.38 | 2.25 |
○Loop 65 | exec | haloExchange.c:633-642 | sortAtomsInCell | Innermost | 0.06 | 0.06 | 0.16 | 1.33 | 1.00 | 4.00 | 10.67 | 1 | 0.00 | 25.00 | 10.67 | 8.00 | 10.67 | 2.67 | 1.00 |
○Loop 37 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.05 | 0.05 | 0.12 | 1.13 | 1.00 | 4.69 | 5.67 | 1 | 0.00 | 23.21 | 5.67 | 5.00 | 5.67 | 1.21 | 1.00 |
○Loop 64 | exec | haloExchange.c:617-642,haloExchange.c:645-645 | sortAtomsInCell | Outermost | 0.02 | 0.02 | 0.06 | 1.71 | 1.00 | 5.20 | 8.48 | 1 | 0.00 | 22.19 | 14.83 | 8.67 | 14.83 | 2.85 | 1.75 |
○Loop 130 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Innermost | 0.01 | 0.02 | 0.04 | 1.00 | 1.00 | 3.84 | 1.00 | 1 | 14.29 | 28.57 | 8.00 | 8.00 | 8.00 | 2.08 | 8.00 |
○Loop 120 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0.01 | 0.02 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 20.00 - 50.00 | 20.00 - 50.00 | 20.00 - 50.00 | 5.00 - 12.50 | 20.00 - 50.00 |
○Loop 101 | exec | ljForce.c:175-178 | ljForce._omp_fn.1 | Innermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 8.00 | 4.00 | 1 | 0.00 | 12.50 | 1.00 | 1.00 | 1.00 | 0.13 | 0.25 |
○Loop 93 | exec | linkCells.c:295-301 | updateLinkCells | Innermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 6.40 | 3.00 | 2 | 0.00 | 16.93 | 3.00 | 3.00 | 3.00 | 0.47 | 1.00 |
○Loop 121 | exec | timestep.c:85-85,timestep.c:88-88,timestep.c:91-92 | advancePosition._omp_fn.0 | Outermost | 0 | 0 | 0.01 | 10.00 | 1.00 | 5.33 | 10.00 | 1 | 0.00 | 21.67 | 5.00 | 0.50 | 5.00 | 0.94 | 0.50 |
○Loop 92 | exec | linkCells.c:291-297 | updateLinkCells | Outermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 8.00 | 1.75 | 3 | 0.00 | 12.50 | 1.75 | 1.75 | 1.75 | 0.22 | 1.00 |
○Loop 82 | exec | initAtoms.c:39-39,mytype.h:22-24 | initAtoms | Single | 0 | 0 | 0.01 | 1.00 | NA | NA | 16.50 | 1 | NA | NA | 16.50 | 16.50 | NA | NA | 1.00 |
○Loop 36 | exec | haloExchange.c:376-383,haloExchange.c:390-390 | loadAtomsBuffer | Outermost | 0 | 0 | 0.01 | 1.00 | 1.00 | 6.78 | 1.00 | 2 | 0.00 | 17.97 | 3.00 | 3.00 | 3.00 | 0.44 | 3.00 |