ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 25 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce._omp_fn.1 | Innermost | 9.37 | 11.77 | 57.76 | 1.00 | 3.16 | 4.88 | 1.03 | 9 | 12.50 | 14.06 | 152.77 | 19.50 | 19.50 | 6.17 | 4.00 | 19.00 |
○Loop 16 | exec | mytype.h:22-24,ljForce.c:161-161 | ljForce._omp_fn.0 | Single | 1.2 | 1.45 | 7.1 | 3.00 | 1.00 | 8.00 | 12.00 | 1 | 33.33 | 12.50 | 0.00 | 12.00 | 4.00 | 12.00 | 1.50 | 1.00 |
○Loop 24 | exec | ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-198,ljForce.c:201-201,ljForce.c:206-210,ljForce.c:213-213,ljForce.c:216-216 | ljForce._omp_fn.1 | InBetween | 0.89 | 0.98 | 4.83 | 1.00 | 3.29 | 5.25 | 1.11 | 40 | 11.76 | 13.79 | 154.31 | 10.50 | 10.50 | 3.19 | 2.00 | 9.50 |
○Loop 18 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Innermost | 0.6 | 0.55 | 2.72 | 1.00 | 1.00 | 8.00 | 1.33 | 1 | 0.00 | 12.50 | 4.68 | 16.00 | 16.00 | 16.00 | 2.00 | 12.00 |
○Loop 61 | exec | haloExchange.c:621-630 | sortAtomsInCell | Single | 0.49 | 0.49 | 2.41 | 1.33 | 1.00 | 9.14 | 8.00 | 1 | 0.00 | 10.94 | 0.00 | 8.00 | 6.00 | 8.00 | 0.88 | 1.00 |
○Loop 17 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Outermost | 0.59 | 0.45 | 2.22 | 1.29 | 1.31 | 9.79 | 1.48 | 16 | 0.00 | 12.41 | 3.06 | 18.50 | 14.33 | 14.17 | 1.89 | 12.50 |
○Loop 20 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Innermost | 0.4 | 0.39 | 1.92 | 1.00 | 1.68 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | 8.31 | 16.00 | 16.00 | 9.50 | 8.00 | 16.00 |
○Loop 19 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Outermost | 0.29 | 0.21 | 1.05 | 1.13 | 1.83 | 2.25 | 1.13 | 8 | 0.00 | 12.15 | 4.81 | 13.50 | 12.00 | 7.38 | 6.00 | 12.00 |
○Loop 23 | exec | ljForce.c:178-184,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:201-201 | ljForce._omp_fn.1 | InBetween | 0.09 | 0.06 | 0.3 | 2.89 | 1.00 | 14.90 | 2.89 | 41 | 16.67 | 11.46 | 208.27 | 4.33 | 1.50 | 4.33 | 0.29 | 1.50 |
○Loop 28 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Innermost | 0.04 | 0.03 | 0.15 | 1.00 | 2.00 | 2.00 | 1.00 | 1 | 0.00 | 12.50 | 11.35 | 16.00 | 16.00 | 8.00 | 8.00 | 16.00 |
○Loop 27 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Outermost | 0.04 | 0.02 | 0.07 | 1.00 | 2.00 | 2.00 | 1.00 | 8 | 0.00 | 12.08 | 2.30 | 12.00 | 12.00 | 6.00 | 6.00 | 12.00 |
○Loop 2 | exec | haloExchange.c:380-390 | loadAtomsBuffer | Innermost | 0.45 | 0.01 | 0.06 | 1.33 | 1.00 | 9.14 | 2.67 | 1 | 0.00 | 11.25 | 12.30 | 4.00 | 3.00 | 4.00 | 0.44 | 1.50 |
○Loop 60 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.03 | 0.01 | 0.06 | 1.33 | 1.00 | 9.14 | 8.00 | 1 | 0.00 | 10.94 | 0.00 | 8.00 | 6.00 | 8.00 | 0.88 | 1.00 |
○Loop 15 | exec | initAtoms.c:197-202,random.c:45-48,random.c:68-70 | randomDisplacements._omp_fn.0 | Innermost | 0.01 | 0.01 | 0.04 | 11.09 | 1.00 | 8.23 | 19.40 | 1 | 2.31 | 12.64 | 4.15 | 38.80 | 3.50 | 38.80 | 4.71 | 2.00 |
○Loop 83 | exec | linkCells.c:295-301 | updateLinkCells | Innermost | 0.26 | 0.01 | 0.03 | 1.00 | 1.00 | 15.09 | 3.00 | 2 | 0.00 | 9.06 | 29.20 | 3.00 | 3.00 | 3.00 | 0.20 | 1.00 |
○Loop 21 | exec | ljForce.c:175-175,ljForce.c:178-180,ljForce.c:187-187,ljForce.c:213-213 | ljForce._omp_fn.1 | Outermost | 0.02 | 0 | 0.02 | 2.27 | 1.00 | 10.46 | 2.83 | 2 | 0.00 | 8.17 | NA | 2.83 | 1.25 | 2.83 | 0.27 | 1.00 |
○Loop 52 | exec | initAtoms.c:221-221,initAtoms.c:228-228 | computeVcm._omp_fn.0 | Innermost | 0.01 | 0 | 0.02 | 1.00 | 1.00 | 5.54 | 1.00 | 1 | 33.33 | 20.83 | NA | 12.00 | 12.00 | 12.00 | 2.17 | 12.00 |
○Loop 94 | exec | initAtoms.c:154-162,random.c:45-46,random.c:68-70 | setTemperature._omp_fn.0 | Innermost | 0.01 | 0 | 0.01 | 1.00 | 3.09 | 10.35 - 11.76 | 1.00 | 1 | 0.00 | 12.07 | NA | 44.00 - 50.00 | 44.00 - 50.00 | 14.85 - 16.21 | 4.25 | 44.00 - 50.00 |
○Loop 92 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0.01 | 0 | 0.01 | 1.00 | 2.06 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | NA | 13.00 - 21.00 | 13.00 - 21.00 | 6.31 - 10.19 | 1.63 - 2.63 | 13.00 - 21.00 |
○Loop 81 | exec | haloExchange.c:414-414,haloExchange.c:424-424 | unloadAtomsBuffer | Single | 0.12 | 0 | 0.01 | 1.33 | 1.00 | 9.14 | 5.33 | 1 | 0.00 | 11.25 | NA | 5.33 | 4.00 | 5.33 | 0.58 | 1.00 |
○Loop 51 | exec | initAtoms.c:221-223,initAtoms.c:227-228 | computeVcm._omp_fn.0 | Outermost | 0.01 | 0 | 0.01 | 1.13 | 1.13 | 6.78 | 3.39 | 8 | 25.00 | 18.23 | NA | 10.17 | 9.00 | 9.00 | 1.50 | 3.00 |
○Loop 62 | exec | timestep.c:154-154 | redistributeAtoms._omp_fn.0 | Single | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 8.00 | 4.22 | 1 | 0.00 | 12.50 | NA | 7.17 | 7.17 | 7.17 | 0.90 | 1.70 |
○Loop 82 | exec | linkCells.c:291-295 | updateLinkCells | Outermost | 0.03 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 88 | exec | initAtoms.c:90-100 | createFccLattice | Innermost | 0.03 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 1 | exec | haloExchange.c:376-383 | loadAtomsBuffer | Outermost | 0.02 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 13 | exec | initAtoms.c:177-181 | setTemperature._omp_fn.1 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 11 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 12 | exec | initAtoms.c:177-181 | setTemperature._omp_fn.1 | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 63 | exec | initAtoms.c:39-39,mytype.h:22-22 | initAtoms | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 79 | exec | linkCells.c:118-120 | initLinkCells | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 77 | exec | linkCells.c:151-153 | getNeighborBoxes | InBetween | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 10 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 84 | exec | linkCells.c:384-385 | updateLinkCells | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |