ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 103 | exec | ljForce.c:191-191,ljForce.c:197-216 | ljForce._omp_fn.1 | Innermost | 11.19 | 9.4 | 54.06 | 2.02 | 1.00 | 3.86 | 1.20 | 1 | 38.10 | 34.52 | 13.50 | 6.67 | 13.50 | 3.50 | 11.25 |
○Loop 102 | exec | ljForce.c:172-172,ljForce.c:178-178,ljForce.c:187-187,ljForce.c:191-191,ljForce.c:197-197 | ljForce._omp_fn.1 | Outermost | 0.37 | 0.24 | 1.38 | 4.87 | 1.00 | 4.95 | 6.50 | 1 | 0.00 | 22.92 | 3.25 | 0.67 | 3.25 | 0.66 | 0.50 |
○Loop 118 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | Innermost | 0.26 | 0.22 | 1.29 | 1.00 | 1.00 | 3.89 | 3.89 | 1 | 2.86 | 25.71 | 11.67 | 11.67 | 11.67 | 3.00 | 3.00 |
○Loop 101 | exec | mytype.h:22-24,ljForce.c:161-161 | ljForce._omp_fn.0 | Innermost | 0.23 | 0.21 | 1.22 | 1.00 | NA | NA | 13.00 | 1 | NA | NA | 13.00 | 13.00 | NA | NA | 1.00 |
○Loop 69 | exec | haloExchange.c:621-630 | sortAtomsInCell | Innermost | 0.2 | 0.12 | 0.71 | 1.33 | 1.00 | 4.00 | 10.67 | 1 | 0.00 | 25.00 | 10.67 | 8.00 | 10.67 | 2.67 | 1.00 |
○Loop 114 | exec | timestep.c:88-94 | advancePosition._omp_fn.0 | Innermost | 0.17 | 0.12 | 0.71 | 1.10 | 1.00 | 4.00 | 2.10 | 1 | 0.00 | 25.00 | 7.33 | 6.67 | 7.33 | 1.83 | 3.50 |
○Loop 67 | exec | haloExchange.c:617-642,haloExchange.c:645-645 | sortAtomsInCell | Outermost | 0.1 | 0.06 | 0.35 | 1.71 | 1.00 | 5.20 | 8.48 | 1 | 0.00 | 22.19 | 14.83 | 8.67 | 14.83 | 2.85 | 1.75 |
○Loop 117 | exec | timestep.c:74-78 | advanceVelocity._omp_fn.0 | InBetween | 0.09 | 0.04 | 0.24 | 1.12 | 1.00 | 3.86 | 4.07 | 8 | 6.98 | 26.16 | 9.17 | 8.17 | 9.17 | 2.38 | 2.25 |
○Loop 68 | exec | haloExchange.c:633-642 | sortAtomsInCell | Innermost | 0.05 | 0.02 | 0.12 | 1.33 | 1.00 | 4.00 | 10.67 | 1 | 0.00 | 25.00 | 10.67 | 8.00 | 10.67 | 2.67 | 1.00 |
○Loop 40 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.59 | 0.01 | 0.05 | 1.13 | 1.00 | 4.69 | 5.67 | 1 | 0.00 | 23.21 | 5.67 | 5.00 | 5.67 | 1.21 | 1.00 |
○Loop 122 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | Innermost | 0.02 | 0.01 | 0.05 | 1.00 | 1.00 | 3.84 | 1.00 | 1 | 14.29 | 28.57 | 8.00 | 8.00 | 8.00 | 2.08 | 8.00 |
○Loop 94 | exec | linkCells.c:179-195,linkCells.c:354-354,linkCells.c:373-378 | putAtomInBox | Single | 0.31 | 0 | 0.03 | 2.93 | 1.00 | 4.51 | 7.33 | 1 | 38.46 | 28.85 | 7.33 | 2.50 | 7.33 | 1.63 | 1.00 |
○Loop 104 | exec | ljForce.c:175-178 | ljForce._omp_fn.1 | Innermost | 0.01 | 0 | 0.02 | 1.00 | 1.00 | 8.00 | 4.00 | 1 | 0.00 | 12.50 | 1.00 | 1.00 | 1.00 | 0.13 | 0.25 |
○Loop 121 | exec | timestep.c:110-116 | kineticEnergy._omp_fn.0 | InBetween | 0.01 | 0 | 0.01 | 2.63 | 1.00 | 5.25 - 5.73 | 2.10 | 8 | 15.00 | 28.13 | 7.88 | 3.00 | 7.88 | 1.50 - 1.38 | 3.75 |
○Loop 85 | exec | mytype.h:22-24,initAtoms.c:39-46 | initAtoms | Single | 0.09 | 0 | 0.01 | 1.00 | NA | NA | 23.00 | 1 | NA | NA | 23.00 | 23.00 | NA | NA | 1.00 |
○Loop 112 | exec | random.c:26-29,random.c:45-48 | gasdev | Single | 0 | 0 | 0.01 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 0.00 | 25.00 | 20.00 - 50.00 | 20.00 - 50.00 | 20.00 - 50.00 | 5.00 - 12.50 | 20.00 - 50.00 |
○Loop 113 | exec | timestep.c:85-85,timestep.c:88-88,timestep.c:91-92 | advancePosition._omp_fn.0 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 39 | exec | haloExchange.c:376-383,haloExchange.c:390-390 | loadAtomsBuffer | Outermost | 0.04 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 96 | exec | linkCells.c:291-295 | updateLinkCells | Single | 0.02 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 87 | exec | initAtoms.c:89-90,initAtoms.c:93-100 | createFccLattice | InBetween | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 100 | exec | mytype.h:22-24,ljForce.c:157-161 | ljForce._omp_fn.0 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 50 | exec | haloExchange.c:414-414,haloExchange.c:424-424 | unloadAtomsBuffer | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 90 | exec | linkCells.c:149-153,linkCells.c:209-247,linkCells.c:455-461 | getNeighborBoxes | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 81 | exec | initAtoms.c:154-162 | setTemperature._omp_fn.0 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 73 | exec | initAtoms.c:126-133 | setVcm._omp_fn.0 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 124 | exec | timestep.c:154-154 | redistributeAtoms._omp_fn.0 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |