Loop id | Source Location | Source Function | Level | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x56 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x56 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x56 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x56 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x56 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 2x1 | Speedup If Perfect Load Balancing 2x2 | Speedup If Perfect Load Balancing 2x4 | Speedup If Perfect Load Balancing 2x8 | Speedup If Perfect Load Balancing 2x16 | Speedup If Perfect Load Balancing 2x32 | Speedup If Perfect Load Balancing 2x56 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x56) Efficiency | (2x56) Potential Speed-Up (%) |
---|
233 | exec - flux_calc.cpp:39-40 | flux_calc_kernel(int, int, int, int, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double... | Innermost | 8.82 | 8.37 | 8.42 | 8.08 | 7.3 | 5.13 | 4.45 | 135.37 | 69.7 | 36.36 | 18.11 | 9.22 | 4.92 | 3.43 | 135.18 | 67.77 | 34.35 | 17.25 | 8.63 | 4.6 | 3.34 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.52 | 3.01 | 5.98 | 11.91 | 23.81 | 44.70 | 61.39 | 100 | 100 | 1 | 1 | 1 | 1 | 1.03 | 1.06 | 1.06 | 1.07 | 1.08 | 1.03 | 1 | 12 | 0 | 0 | 0 | 1 | 0 | 1 | 0.02 | 0.98 | 0.14 | 0.98 | 0.17 | 0.98 | 0.15 | 0.92 | 0.42 | 0.72 | 1.23 |
589 | exec - viscosity.cpp:39-64 [...] | viscosity_kernel(int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 8.53 | 8.35 | 8.27 | 7.94 | 7.1 | 4.73 | 3.13 | 130.7 | 67.36 | 33.82 | 16.98 | 8.47 | 4.39 | 2.49 | 130.75 | 67.61 | 33.74 | 16.94 | 8.4 | 4.24 | 2.35 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 6.60 | 12.76 | 25.56 | 50.91 | 102.66 | 203.38 | 367.07 | 10.59 | 13.82 | 1 | 1.8 | 1.8 | 1 | 1 | 1 | 1.01 | 1.01 | 1.05 | 1.06 | NA | NA | NA | NA | NA | 1 | 0 | 0.97 | 0.28 | 0.97 | 0.26 | 0.96 | 0.28 | 0.97 | 0.19 | 0.96 | 0.17 | 0.99 | 0.02 |
220 | exec - calc_dt.cpp:52-75 [...] | calc_dt_kernel(int, int, int, int, double, double, double, double, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer1D<double>&am... | Innermost | 6.47 | 6.75 | 6.69 | 6.4 | 5.76 | 3.85 | 3.35 | 99.23 | 54.73 | 27.49 | 13.85 | 6.97 | 3.57 | 2.84 | 99.19 | 54.68 | 27.27 | 13.65 | 6.81 | 3.45 | 2.52 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 7.87 | 14.28 | 28.69 | 57.29 | 114.84 | 226.55 | 309.44 | 99.21 | 98.12 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.02 | 1.05 | 1.13 | 2 | 0 | 0 | 0 | 14 | 1 | 0 | 0.91 | 0.63 | 0.91 | 0.61 | 0.91 | 0.59 | 0.91 | 0.52 | 0.9 | 0.39 | 0.7 | 1 |
177 | exec - advec_cell.cpp:163-202 [...] | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 5.25 | 5.65 | 5.7 | 5.47 | 4.92 | 3.42 | 3.28 | 80.54 | 47.41 | 24.35 | 12.18 | 6.14 | 3.19 | 2.54 | 80.45 | 45.8 | 23.24 | 11.67 | 5.83 | 3.06 | 2.47 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 8.93 | 15.68 | 30.89 | 61.53 | 123.14 | 234.63 | 290.73 | 98.87 | 94.35 | 1.09 | 1 | 1 | 1 | 1.04 | 1.05 | 1.05 | 1.05 | 1.05 | 1.03 | 3 | 2 | 1 | 0 | 15 | 1 | 0 | 0.88 | 0.69 | 0.87 | 0.77 | 0.86 | 0.76 | 0.86 | 0.68 | 0.82 | 0.61 | 0.58 | 1.37 |
203 | exec - advec_mom.cpp:186-211 [...] | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 4.83 | 5.32 | 5.35 | 5.14 | 4.72 | 3.85 | 4.31 | 74.06 | 44.04 | 21.99 | 11.16 | 5.76 | 3.74 | 3.48 | 74.02 | 43.13 | 21.81 | 10.97 | 5.59 | 3.46 | 3.24 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 10.57 | 18.12 | 35.84 | 71.25 | 139.87 | 226.05 | 241.43 | 96.15 | 91.11 | 1 | 1 | 1 | 1 | 1.03 | 1.01 | 1.03 | 1.03 | 1.09 | 1.08 | 3 | 2 | 0 | 0 | 6 | 1 | 0 | 0.86 | 0.75 | 0.85 | 0.81 | 0.84 | 0.8 | 0.83 | 0.81 | 0.67 | 1.28 | 0.41 | 2.55 |
276 | exec - PdV.cpp:72-83 [...] | PdV_kernel(bool, int, int, int, int, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double... | Innermost | 4.52 | 4.38 | 4.35 | 4.47 | 4.85 | 5.49 | 6.34 | 69.38 | 35.4 | 17.94 | 9.9 | 5.81 | 5.43 | 4.84 | 69.28 | 35.48 | 17.75 | 9.53 | 5.74 | 4.92 | 4.77 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 8.58 | 16.78 | 33.54 | 62.49 | 103.72 | 121.01 | 124.64 | 100 | 100 | 1 | 1 | 1 | 1 | 1 | 1.01 | 1.05 | 1.01 | 1.11 | 1.02 | 1 | 11 | 1 | 1 | 0 | 1 | 0 | 0.98 | 0.1 | 0.98 | 0.11 | 0.91 | 0.41 | 0.75 | 1.19 | 0.44 | 3.07 | 0.26 | 4.7 |
200 | exec - advec_mom.cpp:114-139 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 4.52 | 4.78 | 4.78 | 4.54 | 4.25 | 3.76 | 4.3 | 69.35 | 38.96 | 19.66 | 9.8 | 5.13 | 3.69 | 3.43 | 69.22 | 38.7 | 19.48 | 9.69 | 5.03 | 3.37 | 3.23 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 11.85 | 21.19 | 42.05 | 84.53 | 162.85 | 243.08 | 253.86 | 100 | 94.02 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.02 | 1.11 | 1.06 | 2 | 3 | 0 | 0 | 6 | 1 | 0 | 0.89 | 0.51 | 0.89 | 0.53 | 0.89 | 0.49 | 0.86 | 0.59 | 0.64 | 1.35 | 0.38 | 2.65 |
273 | exec - PdV.cpp:51-63 [...] | PdV_kernel(bool, int, int, int, int, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double... | Innermost | 3.85 | 3.77 | 3.76 | 3.82 | 4.13 | 4.64 | 5.35 | 59.11 | 30.46 | 15.54 | 8.68 | 4.96 | 4.6 | 4.14 | 58.98 | 30.56 | 15.34 | 8.14 | 4.89 | 4.16 | 4.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 8.69 | 16.75 | 33.37 | 62.89 | 104.69 | 123.11 | 127.62 | 100 | 100 | 1 | 1 | 1 | 1 | 1 | 1.02 | 1.08 | 1.02 | 1.12 | 1.03 | 1 | 11 | 1 | 1 | 0 | 1 | 0 | 0.96 | 0.13 | 0.96 | 0.15 | 0.91 | 0.36 | 0.75 | 1.02 | 0.44 | 2.58 | 0.26 | 3.95 |
243 | exec - ideal_gas.cpp:40-45 | ideal_gas_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&) [clone ._omp_fn.0] | Innermost | 3.82 | 4.53 | 4.49 | 4.34 | 4.38 | 4.29 | 4.78 | 58.66 | 37.34 | 18.84 | 9.5 | 5.36 | 4.22 | 3.64 | 58.61 | 36.72 | 18.33 | 9.27 | 5.18 | 3.85 | 3.59 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 5.96 | 9.51 | 19.04 | 37.70 | 67.49 | 90.79 | 97.08 | 100 | 100 | 1 | 1 | 1 | 1 | 1.02 | 1.03 | 1.03 | 1.04 | 1.11 | 1.01 | 0 | 3 | 1 | 0 | 0 | 1 | 0 | 0.8 | 0.91 | 0.8 | 0.9 | 0.79 | 0.91 | 0.71 | 1.28 | 0.48 | 2.25 | 0.29 | 3.39 |
157 | exec - accelerate.cpp:43-53 | accelerate_kernel(int, int, int, int, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<doubl... | Innermost | 3.57 | 3.75 | 3.72 | 3.74 | 4.03 | 4.65 | 5.44 | 54.91 | 30.46 | 15.28 | 8.03 | 4.86 | 4.62 | 4.18 | 54.74 | 30.34 | 15.18 | 7.97 | 4.77 | 4.17 | 4.09 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 13.49 | 24.35 | 48.57 | 92.48 | 154.48 | 176.71 | 180.39 | 100 | 100 | 1.06 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.02 | 1.12 | 1.02 | 1 | 0 | 11 | 2 | 0 | 1 | 0 | 0.9 | 0.37 | 0.9 | 0.37 | 0.86 | 0.53 | 0.72 | 1.14 | 0.41 | 2.74 | 0.24 | 4.14 |
193 | exec - advec_mom.cpp:170-172 [...] | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 3.51 | 2.81 | 2.66 | 2.71 | 2.7 | 2.73 | 3.01 | 53.95 | 25.54 | 11.63 | 7.48 | 3.92 | 2.86 | 2.34 | 53.75 | 22.73 | 10.85 | 5.79 | 3.19 | 2.45 | 2.27 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.81 | 9.04 | 18.94 | 35.50 | 64.34 | 83.74 | 89.92 | 100 | 100 | 1 | 1 | 1 | 1 | 1.13 | 1.07 | 1.3 | 1.23 | 1.18 | 1.04 | 1 | 12 | 0 | 0 | 0 | 1 | 0 | 1.18 | 0 | 1.24 | 0 | 1.16 | 0 | 1.05 | 0 | 0.69 | 0.86 | 0.42 | 1.74 |
187 | exec - advec_mom.cpp:98-100 [...] | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 3.47 | 2.73 | 2.68 | 2.73 | 2.68 | 2.71 | 3.01 | 53.37 | 23.38 | 11.44 | 7.47 | 3.88 | 2.86 | 2.32 | 53.19 | 22.12 | 10.92 | 5.82 | 3.17 | 2.43 | 2.26 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.85 | 9.26 | 18.82 | 35.31 | 64.84 | 84.47 | 91.08 | 100 | 100 | 1 | 1 | 1 | 1 | 1.06 | 1.05 | 1.29 | 1.22 | 1.19 | 1.03 | 1 | 12 | 0 | 0 | 0 | 1 | 0 | 1.2 | 0 | 1.22 | 0 | 1.14 | 0 | 1.05 | 0 | 0.68 | 0.86 | 0.42 | 1.74 |
174 | exec - advec_cell.cpp:71-110 [...] | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 3.36 | 3.56 | 3.56 | 3.45 | 3.24 | 2.98 | 3.42 | 51.56 | 29.02 | 14.78 | 7.51 | 4.01 | 2.97 | 2.65 | 51.53 | 28.8 | 14.52 | 7.36 | 3.83 | 2.67 | 2.57 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 13.94 | 24.94 | 49.46 | 97.56 | 187.47 | 268.94 | 279.51 | 100 | 94.44 | 1.05 | 1 | 1.01 | 1 | 1.01 | 1.02 | 1.03 | 1.05 | 1.13 | 1.03 | 2 | 3 | 1 | 0 | 16 | 1 | 0 | 0.89 | 0.38 | 0.89 | 0.4 | 0.88 | 0.43 | 0.84 | 0.52 | 0.6 | 1.18 | 0.36 | 2.2 |
179 | exec - advec_mom.cpp:47-48 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 3.01 | 2.61 | 2.57 | 2.56 | 2.65 | 2.68 | 3.02 | 46.36 | 21.08 | 10.69 | 6 | 3.43 | 2.67 | 2.33 | 46.2 | 21.11 | 10.49 | 5.46 | 3.13 | 2.41 | 2.27 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.79 | 3.93 | 7.77 | 14.97 | 26.16 | 34.05 | 36.59 | 100 | 100 | 1 | 1 | 1 | 1 | 1 | 1.02 | 1.11 | 1.1 | 1.12 | 1.03 | 0 | 7 | 0 | 0 | 0 | 1 | 0 | 1.09 | 0 | 1.1 | 0 | 1.06 | 0 | 0.92 | 0.21 | 0.6 | 1.07 | 0.36 | 1.92 |
181 | exec - advec_mom.cpp:56-57 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 2.93 | 2.57 | 2.52 | 2.51 | 2.59 | 2.63 | 2.95 | 45.12 | 21.27 | 10.42 | 5.92 | 3.36 | 2.62 | 2.3 | 44.99 | 20.86 | 10.26 | 5.34 | 3.06 | 2.36 | 2.22 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.80 | 3.88 | 7.89 | 15.19 | 26.51 | 34.30 | 36.62 | 100 | 100 | 1 | 1 | 1 | 1 | 1.03 | 1.02 | 1.12 | 1.1 | 1.12 | 1.04 | 0 | 7 | 0 | 0 | 0 | 1 | 0 | 1.08 | 0 | 1.1 | 0 | 1.05 | 0 | 0.92 | 0.21 | 0.6 | 1.06 | 0.36 | 1.88 |
197 | exec - advec_mom.cpp:221-221 [...] | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 2.91 | 2.58 | 2.55 | 2.65 | 3.06 | 3.78 | 4.42 | 44.6 | 20.95 | 10.45 | 5.73 | 3.69 | 3.76 | 3.39 | 44.62 | 20.87 | 10.39 | 5.65 | 3.62 | 3.39 | 3.32 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.68 | 7.86 | 15.53 | 28.51 | 44.56 | 47.60 | 49.35 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.02 | 1.12 | 1.02 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 1.07 | 0 | 1.07 | 0 | 0.99 | 0.03 | 0.77 | 0.7 | 0.41 | 2.23 | 0.24 | 3.36 |
191 | exec - advec_mom.cpp:149-149 [...] | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 2.86 | 2.54 | 2.51 | 2.64 | 3.05 | 3.78 | 4.42 | 43.77 | 20.72 | 10.34 | 5.66 | 3.66 | 3.79 | 3.4 | 43.78 | 20.59 | 10.25 | 5.62 | 3.61 | 3.39 | 3.32 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.75 | 7.96 | 16.04 | 29.26 | 45.42 | 48.47 | 49.24 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.01 | 1.02 | 1.13 | 1.02 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 1.06 | 0 | 1.07 | 0 | 0.97 | 0.07 | 0.76 | 0.74 | 0.4 | 2.25 | 0.24 | 3.38 |
185 | exec - advec_mom.cpp:74-75 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 2.28 | 2.43 | 2.41 | 2.34 | 2.32 | 2.32 | 2.59 | 35.04 | 19.79 | 9.9 | 5.02 | 2.79 | 2.29 | 1.99 | 34.94 | 19.68 | 9.84 | 4.99 | 2.75 | 2.08 | 1.95 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.18 | 2.07 | 4.14 | 8.16 | 14.82 | 19.80 | 21.03 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.01 | 1.02 | 1.11 | 1.03 | 1 | 5 | 0 | 0 | 0 | 1 | 0 | 0.89 | 0.27 | 0.89 | 0.27 | 0.88 | 0.29 | 0.79 | 0.48 | 0.52 | 1.1 | 0.32 | 1.76 |
183 | exec - advec_mom.cpp:65-66 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 2.19 | 2.28 | 2.27 | 2.22 | 2.25 | 2.27 | 2.52 | 33.58 | 18.59 | 9.34 | 4.77 | 2.71 | 2.24 | 1.96 | 33.52 | 18.51 | 9.25 | 4.74 | 2.66 | 2.04 | 1.9 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.21 | 2.22 | 4.44 | 8.63 | 15.40 | 19.82 | 21.09 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.01 | 1.02 | 1.11 | 1.04 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 0.91 | 0.22 | 0.91 | 0.21 | 0.88 | 0.26 | 0.79 | 0.48 | 0.51 | 1.1 | 0.32 | 1.73 |
171 | exec - advec_cell.cpp:211-216 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 2.18 | 2.01 | 1.99 | 2.11 | 2.54 | 3.04 | 3.55 | 33.43 | 16.28 | 8.15 | 4.51 | 3.03 | 3.03 | 2.71 | 33.43 | 16.31 | 8.13 | 4.5 | 3 | 2.73 | 2.67 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 6.14 | 12.58 | 25.33 | 45.76 | 68.60 | 75.24 | 77.32 | 100 | 100 | 1 | 1 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.12 | 1.02 | 1 | 9 | 0 | 0 | 0 | 1 | 0 | 1.02 | 0 | 1.03 | 0 | 0.93 | 0.15 | 0.7 | 0.77 | 0.38 | 1.88 | 0.22 | 2.76 |
164 | exec - advec_cell.cpp:120-125 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 2.08 | 1.97 | 1.96 | 2.09 | 2.53 | 3.04 | 3.55 | 31.97 | 16.03 | 8.02 | 4.46 | 3.03 | 3.02 | 2.74 | 31.9 | 15.98 | 7.98 | 4.46 | 3 | 2.72 | 2.67 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 6.44 | 12.80 | 25.81 | 46.17 | 68.65 | 75.73 | 76.36 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.01 | 1.01 | 1.12 | 1.03 | 1 | 9 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.89 | 0.22 | 0.66 | 0.85 | 0.37 | 1.93 | 0.21 | 2.79 |
280 | exec - reset_field.cpp:47-48 | reset_field_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&a... | Innermost | 1.96 | 2.32 | 2.3 | 2.22 | 2.22 | 2.29 | 2.6 | 30.14 | 18.8 | 9.44 | 4.81 | 2.68 | 2.24 | 2.03 | 30.05 | 18.8 | 9.4 | 4.75 | 2.63 | 2.05 | 1.95 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.02 | 1.1 | 1.04 | 1 | 4 | 0 | 0 | 0 | 1 | 0 | 0.8 | 0.47 | 0.8 | 0.46 | 0.79 | 0.46 | 0.71 | 0.63 | 0.46 | 1.24 | 0.28 | 1.88 |
285 | exec - revert.cpp:37-38 | revert_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&) [clone ._omp_fn.0] | Innermost | 1.84 | 2.07 | 2.06 | 2 | 1.97 | 2 | 2.26 | 28.27 | 17.15 | 8.6 | 4.38 | 2.4 | 1.97 | 1.73 | 28.28 | 16.78 | 8.39 | 4.26 | 2.33 | 1.79 | 1.7 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 100 | 1 | 1 | 1 | 1 | 1.03 | 1.03 | 1.04 | 1.03 | 1.11 | 1.02 | 1 | 4 | 0 | 0 | 0 | 1 | 0 | 0.84 | 0.33 | 0.84 | 0.32 | 0.83 | 0.34 | 0.76 | 0.48 | 0.49 | 1.01 | 0.3 | 1.59 |
282 | exec - reset_field.cpp:37-38 | reset_field_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&a... | Innermost | 1.84 | 2.06 | 2.05 | 1.98 | 1.97 | 2 | 2.27 | 28.3 | 17.12 | 8.62 | 4.37 | 2.42 | 1.99 | 1.74 | 28.26 | 16.7 | 8.38 | 4.23 | 2.33 | 1.8 | 1.7 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 100 | 1 | 1 | 1 | 1 | 1.03 | 1.03 | 1.04 | 1.04 | 1.12 | 1.02 | 1 | 4 | 0 | 0 | 0 | 1 | 0 | 0.85 | 0.32 | 0.84 | 0.32 | 0.84 | 0.33 | 0.76 | 0.48 | 0.49 | 1.02 | 0.3 | 1.6 |
160 | exec - advec_cell.cpp:47-48 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 1.72 | 1.37 | 1.35 | 1.33 | 1.36 | 1.35 | 1.51 | 26.59 | 11.47 | 6.03 | 3.18 | 1.8 | 1.36 | 1.19 | 26.41 | 11.12 | 5.5 | 2.84 | 1.61 | 1.21 | 1.13 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 2.35 | 5.62 | 11.25 | 21.77 | 38.35 | 51.09 | 55.49 | 100 | 100 | 1 | 1 | 1 | 1.01 | 1.04 | 1.1 | 1.13 | 1.13 | 1.13 | 1.05 | 0 | 5 | 2 | 0 | 0 | 1 | 0 | 1.19 | 0 | 1.2 | 0 | 1.16 | 0 | 1.03 | 0 | 0.68 | 0.43 | 0.42 | 0.88 |
166 | exec - advec_cell.cpp:139-140 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 1.68 | 1.34 | 1.31 | 1.3 | 1.33 | 1.32 | 1.47 | 25.87 | 11.15 | 5.92 | 3.08 | 1.77 | 1.33 | 1.16 | 25.81 | 10.83 | 5.34 | 2.77 | 1.57 | 1.18 | 1.11 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 2.35 | 5.64 | 11.32 | 21.81 | 38.50 | 51.13 | 54.79 | 100 | 100 | 1 | 1 | 1 | 1 | 1.03 | 1.11 | 1.12 | 1.13 | 1.14 | 1.05 | 0 | 5 | 2 | 0 | 0 | 1 | 0 | 1.19 | 0 | 1.21 | 0 | 1.16 | 0 | 1.03 | 0 | 0.68 | 0.42 | 0.42 | 0.86 |
168 | exec - advec_cell.cpp:149-150 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 1.28 | 1.19 | 1.18 | 1.16 | 1.17 | 1.16 | 1.28 | 19.64 | 9.64 | 4.81 | 2.52 | 1.44 | 1.15 | 1.03 | 19.6 | 9.63 | 4.8 | 2.47 | 1.39 | 1.04 | 0.96 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.05 | 2.15 | 4.41 | 8.58 | 15.20 | 20.34 | 21.57 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1 | 1.03 | 1.04 | 1.12 | 1.07 | 0 | 4 | 1 | 0 | 0 | 1 | 0 | 1.02 | 0 | 1.02 | 0 | 0.99 | 0.01 | 0.88 | 0.14 | 0.59 | 0.48 | 0.36 | 0.81 |
162 | exec - advec_cell.cpp:57-58 | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Innermost | 1.23 | 1.19 | 1.18 | 1.16 | 1.15 | 1.13 | 1.26 | 18.97 | 9.68 | 4.84 | 2.5 | 1.43 | 1.14 | 1 | 18.93 | 9.66 | 4.82 | 2.47 | 1.36 | 1.02 | 0.95 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.07 | 2.13 | 4.25 | 8.29 | 14.92 | 19.74 | 21.35 | 100 | 100 | 1 | 1 | 1 | 1 | 1.01 | 1.01 | 1.02 | 1.06 | 1.13 | 1.06 | 0 | 4 | 1 | 0 | 0 | 1 | 0 | 0.98 | 0.02 | 0.98 | 0.02 | 0.96 | 0.05 | 0.87 | 0.15 | 0.58 | 0.47 | 0.36 | 0.81 |
195 | exec - advec_mom.cpp:160-160 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 0.94 | 0.97 | 0.96 | 0.93 | 0.9 | 0.85 | 0.95 | 14.45 | 8.2 | 4.14 | 2.08 | 1.13 | 0.84 | 0.8 | 14.42 | 7.82 | 3.92 | 1.98 | 1.06 | 0.76 | 0.72 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 5.68 | 10.48 | 21.25 | 42.04 | 78.63 | 109.56 | 112.96 | 100 | 100 | 1 | 1 | 1 | 1 | 1.05 | 1.06 | 1.06 | 1.07 | 1.12 | 1.11 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 0.92 | 0.08 | 0.92 | 0.08 | 0.91 | 0.08 | 0.85 | 0.13 | 0.59 | 0.35 | 0.36 | 0.61 |
189 | exec - advec_mom.cpp:88-88 | advec_mom_kernel(int, int, int, int, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&... | Innermost | 0.94 | 0.98 | 0.98 | 0.95 | 0.92 | 0.87 | 0.96 | 14.47 | 8.29 | 4.22 | 2.14 | 1.17 | 0.86 | 0.77 | 14.45 | 7.9 | 3.98 | 2.02 | 1.09 | 0.78 | 0.72 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 5.67 | 10.36 | 20.59 | 40.50 | 75.32 | 105.36 | 114.25 | 100 | 100 | 1 | 1 | 1 | 1 | 1.05 | 1.06 | 1.06 | 1.07 | 1.12 | 1.07 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 0.91 | 0.08 | 0.91 | 0.09 | 0.89 | 0.1 | 0.83 | 0.16 | 0.58 | 0.37 | 0.36 | 0.62 |
229 | exec - context.h:69-69 [...] | field_summary(global_variables&, parallel_&) [clone ._omp_fn.0] | Single | 0.73 | 0.75 | 0.75 | 0.72 | 0.64 | 0.43 | 0.29 | 11.25 | 6.07 | 3.07 | 1.54 | 0.78 | 0.4 | 0.24 | 11.24 | 6.08 | 3.05 | 1.53 | 0.76 | 0.38 | 0.22 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 6.51 | 12.03 | 23.98 | 47.79 | 96.26 | 192.62 | 331.44 | 46.15 | 17.79 | 1.99 | 2.65 | 10.6 | 1 | 1 | 1.01 | 1.01 | 1.03 | 1.05 | 1.09 | 1 | 0 | 0 | 7 | 0 | 1 | 0 | 0.92 | 0.06 | 0.92 | 0.06 | 0.92 | 0.06 | 0.92 | 0.05 | 0.92 | 0.03 | 0.91 | 0.03 |
236 | exec - generate_chunk.cpp:77-80 | generate_chunk(int, global_variables&) [clone ._omp_fn.0] | Innermost | 0.05 | 0.06 | 0.06 | 0.06 | 0.06 | 0.04 | 0.04 | 0.83 | 0.53 | 0.27 | 0.14 | 0.07 | 0.04 | 0.04 | 0.82 | 0.51 | 0.26 | 0.13 | 0.07 | 0.04 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 50 | 56.25 | 1 | 1 | 1 | 1.01 | 1.04 | 1.04 | 1.08 | 1 | 1 | 1.33 | 4 | 4 | 0 | 0 | 0 | 1 | 0 | 0.8 | 0.01 | 0.79 | 0.01 | 0.79 | 0.01 | 0.73 | 0.02 | 0.64 | 0.01 | 0.49 | 0.02 |
249 | exec - initialise_chunk.cpp:80-82 | initialise_chunk(int, global_variables&) [clone ._omp_fn.4] | Innermost | 0.04 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.03 | 0.65 | 0.43 | 0.23 | 0.11 | 0.06 | 0.04 | 0.03 | 0.65 | 0.38 | 0.2 | 0.1 | 0.05 | 0.03 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 80 | 82.5 | 1 | 1 | 1 | 1 | 1.13 | 1.15 | 1.1 | 1.2 | 1.33 | 1.5 | 2 | 4 | 0 | 0 | 0 | 1 | 0 | 0.86 | 0.01 | 0.81 | 0.01 | 0.81 | 0.01 | 0.81 | 0.01 | 0.68 | 0.01 | 0.58 | 0.01 |
255 | exec - pack_kernel.cpp:57-59 [...] | clover_pack_message_left(global_variables&, int, int, int, int, clover::Buffer2D<double>&, clover::Buffer1D<double>&, int, int, int, int, int, int, int) [clone ._omp_fn.0] | Outermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.67 | 0.33 | 0.17 | 0.11 | 0.07 | 0.05 | 0.06 | 0.34 | 0.16 | 0.08 | 0.04 | 0.03 | 0.02 | 0.01 | 1 | 2 | 4 | 8 | 16 | 32 | 56 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 11.96 | 1.93 | 1 | 11.75 | 1 | 1.03 | 1.06 | 1.57 | 1.4 | 1.67 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1.06 | -0 | 1.06 | -0 | 1.06 | -0 | 0.71 | 0.01 | 0.53 | 0.01 | 0.61 | 0.01 |
239 | exec - context.h:46-69 [...] | generate_chunk(int, global_variables&) [clone ._omp_fn.1] | Innermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.58 | 0.46 | 0.36 | 0.22 | 0.11 | 0.06 | 0.04 | 0.38 | 0.2 | 0.1 | 0.05 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 72 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 12.5 | 2.98 | 1 | 8 | 1.53 | 2.3 | 3.6 | 4.4 | 5.5 | 6 | 4 | NA | NA | NA | NA | NA | 1 | 0 | 0.95 | 0 | 0.95 | 0 | 0.95 | 0 | 1.19 | -0 | 1.19 | -0 | 0.68 | 0 |
232 | exec - flux_calc.cpp:38-40 [...] | flux_calc_kernel(int, int, int, int, double, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double... | Outermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.24 | 0.2 | 0.13 | 0.07 | 0.03 | 0.02 | 0.02 | 0.23 | 0.17 | 0.1 | 0.04 | 0.02 | 0.01 | 0 | 2 | 4 | 8 | 16 | 32 | 63 | 105 | 2.13 | 3.14 | 4.91 | 14.50 | 29.05 | 49.15 | 0.00 | 26.45 | 28.27 | 2.62 | 1.06 | 1.31 | 1.04 | 1.18 | 1.3 | 1.75 | 1.5 | 2 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.68 | 0.01 | 0.57 | 0.01 | 0.72 | 0.01 | 0.72 | 0.01 | 0.72 | 0 | 1 | 0 |
259 | exec - pack_kernel.cpp:122-124 [...] | clover_pack_message_right(global_variables&, int, int, int, int, clover::Buffer2D<double>&, clover::Buffer1D<double>&, int, int, int, int, int, int, int) [clone ._omp_fn.0] | Outermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.54 | 0.35 | 0.15 | 0.11 | 0.09 | 0.08 | 0.08 | 0.27 | 0.17 | 0.06 | 0.04 | 0.02 | 0.02 | 0.02 | 1 | 2 | 4 | 8 | 16 | 32 | 56 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 11.72 | 2.06 | 1 | 12.22 | 1 | 1.03 | 1.25 | 1.22 | 1.8 | 2.67 | 2.67 | NA | NA | NA | NA | NA | 1 | 0 | 0.79 | 0 | 1.13 | -0 | 0.84 | 0 | 0.84 | 0 | 0.42 | 0.01 | 0.24 | 0.02 |
257 | exec - pack_kernel.cpp:90-92 [...] | clover_unpack_message_left(global_variables&, int, int, int, int, clover::Buffer2D<double>&, clover::Buffer1D<double>&, int, int, int, int, int, int, int) [clone ._omp_fn.0] | Outermost | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.54 | 0.34 | 0.14 | 0.09 | 0.04 | 0.04 | 0.05 | 0.27 | 0.15 | 0.06 | 0.02 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 32 | 55 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 11.72 | 2.09 | 1 | 12.27 | 1 | 1.1 | 1.17 | 1.8 | 1.33 | 2 | 2.5 | NA | NA | NA | NA | NA | 1 | 0 | 0.9 | 0 | 1.13 | -0 | 1.69 | 0 | 1.69 | 0 | 0.84 | 0 | 0.48 | 0.01 |
175 | exec - advec_cell.cpp:159-202 [...] | advec_cell_kernel(int, int, int, int, int, int, clover::Buffer1D<double>&, clover::Buffer1D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<double>&, clover::Buffer2D<dou... | Outermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.12 | 0.11 | 0.05 | 0.04 | 0.03 | 0.04 | 0.02 | 0.12 | 0.07 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 61 | 87 | 2.00 | 4.91 | 9.87 | 13.20 | 32.40 | 38.00 | 30.00 | 13.21 | 19.1 | 4.05 | 1 | 1.39 | 1 | 1.57 | 1.67 | 2 | 3 | 4 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.86 | 0 | 1 | 0 | 0.75 | 0 | 0.75 | 0 | 0.38 | 0.01 | 0.21 | 0.01 |
261 | exec - pack_kernel.cpp:158-160 [...] | clover_unpack_message_right(global_variables&, int, int, int, int, clover::Buffer2D<double>&, clover::Buffer1D<double>&, int, int, int, int, int, int, int) [clone ._omp_fn.0] | Outermost | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.38 | 0.23 | 0.16 | 0.08 | 0.06 | 0.05 | 0.04 | 0.19 | 0.11 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 30 | 54 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 11.96 | 1.93 | 1 | 11.75 | 1 | 1 | 1.33 | 1.6 | 3 | 2.5 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.86 | 0 | 0.79 | 0 | 0.79 | 0 | 1.19 | -0 | 0.59 | 0 | 0.34 | 0.01 |