ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 25 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce._omp_fn.1 | Innermost | 7.42 | 8.74 | 56.34 | 1.00 | 2.67 | 5.00 | 1.00 | 9 | 0.00 | 12.50 | 72.91 | 20.00 | 20.00 | 7.50 | 4.00 | 20.00 |
○Loop 16 | exec | mytype.h:22-24,ljForce.c:161-161 | ljForce._omp_fn.0 | Single | 1.26 | 1.35 | 8.69 | 1.00 | 1.00 | 16.00 | 16.00 | 1 | 0.00 | 6.25 | 0.00 | 16.00 | 16.00 | 16.00 | 1.00 | 1.00 |
○Loop 24 | exec | ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-198,ljForce.c:201-201,ljForce.c:206-210,ljForce.c:213-213,ljForce.c:216-216 | ljForce._omp_fn.1 | InBetween | 0.75 | 0.73 | 4.74 | 1.00 | 2.74 | 5.25 | 1.05 | 40 | 0.00 | 12.32 | 77.75 | 10.50 | 10.50 | 3.83 | 2.00 | 10.00 |
○Loop 18 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Innermost | 0.53 | 0.43 | 2.77 | 1.00 | 1.00 | 8.00 | 1.33 | 1 | 0.00 | 12.50 | 4.68 | 16.00 | 16.00 | 16.00 | 2.00 | 12.00 |
○Loop 61 | exec | haloExchange.c:621-630 | sortAtomsInCell | Single | 0.51 | 0.39 | 2.5 | 1.33 | 1.00 | 9.14 | 8.00 | 1 | 0.00 | 10.94 | 0.00 | 8.00 | 6.00 | 8.00 | 0.88 | 1.00 |
○Loop 17 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Outermost | 0.58 | 0.39 | 2.49 | 1.29 | 1.31 | 9.79 | 1.48 | 16 | 0.00 | 12.41 | 2.64 | 18.50 | 14.33 | 14.17 | 1.89 | 12.50 |
○Loop 20 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Innermost | 0.36 | 0.35 | 2.28 | 1.00 | 1.68 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | 6.45 | 16.00 | 16.00 | 9.50 | 8.00 | 16.00 |
○Loop 19 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Outermost | 0.3 | 0.2 | 1.26 | 1.13 | 1.83 | 2.25 | 1.13 | 8 | 0.00 | 12.15 | 3.71 | 13.50 | 12.00 | 7.38 | 6.00 | 12.00 |
○Loop 23 | exec | ljForce.c:178-184,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:201-201 | ljForce._omp_fn.1 | InBetween | 0.06 | 0.04 | 0.27 | 2.56 | 1.00 | 14.77 | 2.56 | 41 | 20.00 | 12.50 | 111.15 | 3.83 | 1.50 | 3.83 | 0.26 | 1.50 |
○Loop 28 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Innermost | 0.05 | 0.03 | 0.19 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | 15.03 | 16.00 | 16.00 | 8.00 | 8.00 | 16.00 |
○Loop 27 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Outermost | 0.04 | 0.02 | 0.1 | 1.00 | 2.00 | 2.00 | 1.00 | 8 | 0.00 | 12.08 | 3.03 | 12.00 | 12.00 | 6.00 | 6.00 | 12.00 |
○Loop 2 | exec | haloExchange.c:380-390 | loadAtomsBuffer | Innermost | 0.44 | 0.01 | 0.07 | 1.33 | 1.00 | 9.14 | 2.67 | 1 | 0.00 | 11.25 | 16.30 | 4.00 | 3.00 | 4.00 | 0.44 | 1.50 |
○Loop 60 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.02 | 0.01 | 0.05 | 1.33 | 1.00 | 9.14 | 8.00 | 1 | 0.00 | 10.94 | 0.00 | 8.00 | 6.00 | 8.00 | 0.88 | 1.00 |
○Loop 15 | exec | random.c:45-48,random.c:68-70,initAtoms.c:197-202 | randomDisplacements._omp_fn.0 | Innermost | 0.01 | 0 | 0.03 | 1.00 | 1.32 | 9.28 - 9.28 | 1.00 | 1 | 0.00 | 12.36 | NA | 64.00 - 74.00 | 64.00 - 74.00 | 48.38 - 55.93 | 6.90 - 7.97 | 64.00 - 74.00 |
○Loop 21 | exec | ljForce.c:175-175,ljForce.c:178-180,ljForce.c:187-187,ljForce.c:213-213 | ljForce._omp_fn.1 | Outermost | 0.01 | 0 | 0.02 | 2.27 | 1.00 | 11.83 | 2.83 | 2 | 0.00 | 8.13 | NA | 2.83 | 1.25 | 2.83 | 0.24 | 1.00 |
○Loop 83 | exec | linkCells.c:295-301 | updateLinkCells | Innermost | 0.13 | 0 | 0.02 | 1.00 | 1.00 | 15.09 | 3.00 | 2 | 0.00 | 9.06 | NA | 3.00 | 3.00 | 3.00 | 0.20 | 1.00 |
○Loop 94 | exec | random.c:45-46,random.c:68-70,initAtoms.c:154-162 | setTemperature._omp_fn.0 | Innermost | 0.01 | 0 | 0.01 | 1.00 | 3.09 | 10.35 - 11.76 | 1.00 | 1 | 0.00 | 12.07 | NA | 44.00 - 50.00 | 44.00 - 50.00 | 14.85 - 16.21 | 4.25 | 44.00 - 50.00 |
○Loop 13 | exec | initAtoms.c:177-181 | setTemperature._omp_fn.1 | Innermost | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 6.00 | 6.00 | 6.00 | 0.75 | 6.00 |
○Loop 81 | exec | haloExchange.c:414-414,haloExchange.c:424-424 | unloadAtomsBuffer | Single | 0.08 | 0 | 0.01 | 1.33 | 1.00 | 9.14 | 5.33 | 1 | 0.00 | 11.25 | NA | 5.33 | 4.00 | 5.33 | 0.58 | 1.00 |
○Loop 92 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0 | 0.01 | 1.00 | 2.06 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 13.00 - 21.00 | 13.00 - 21.00 | 6.31 - 10.19 | 1.63 - 2.63 | 13.00 - 21.00 |
○Loop 52 | exec | initAtoms.c:221-228 | computeVcm._omp_fn.0 | Innermost | 0.01 | 0 | 0.01 | 1.00 | 2.18 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 12.00 | 12.00 | 5.50 | 1.50 | 12.00 |
○Loop 62 | exec | timestep.c:154-154 | redistributeAtoms._omp_fn.0 | Single | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 8.00 | 4.22 | 1 | 0.00 | 12.50 | NA | 7.17 | 7.17 | 7.17 | 0.90 | 1.70 |
○Loop 11 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Innermost | 0.01 | 0 | 0.01 | 1.00 | 1.12 | 8.00 | 1.78 | 1 | 0.00 | 12.50 | NA | 10.67 | 10.67 | 9.50 | 1.33 | 6.00 |
○Loop 63 | exec | initAtoms.c:39-39,mytype.h:22-24 | initAtoms | Single | 0.04 | 0 | 0.01 | 1.00 | 1.00 | 16.00 | 18.00 | 1 | 0.00 | 6.25 | NA | 18.00 | 18.00 | 18.00 | 1.13 | 1.00 |
○Loop 51 | exec | initAtoms.c:221-228 | computeVcm._omp_fn.0 | Outermost | 0.01 | 0 | 0.01 | 1.61 | 2.23 | 12.44 | 1.61 | 8 | 0.00 | 11.81 | NA | 9.67 | 6.00 | 4.33 | 0.78 | 6.00 |
○Loop 12 | exec | initAtoms.c:177-181 | setTemperature._omp_fn.1 | Outermost | 0 | 0 | 0.01 | 1.36 | 1.67 | 10.00 | 1.36 | 8 | 0.00 | 12.05 | NA | 7.50 | 5.50 | 4.50 | 0.75 | 5.50 |
○Loop 1 | exec | haloExchange.c:376-383 | loadAtomsBuffer | Outermost | 0.04 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 82 | exec | linkCells.c:291-295 | updateLinkCells | Outermost | 0.03 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 10 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 88 | exec | initAtoms.c:90-100 | createFccLattice | Innermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 77 | exec | linkCells.c:151-153 | getNeighborBoxes | InBetween | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |