Loops
▶advec_mom_kernel.f90: 81 - 44.27%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 166 | 2.68 | 2.68 | 6.28 | 98.21 | 42.08 | 3.35 | 165 | 1.78 | 1.78 | 4.9 | 94.77 | 81.02 | 2.92 | 166 | 2.62 | 2.62 | 5.77 | 98.21 | 42.08 | 4.02 | 165 | 1.76 | 1.76 | 4.36 | 94.77 | 81.02 | 2.94 | ||||||||||||||||||||||||||||
| 175 | 3.36 | 3.36 | 7.88 | 91.86 | 42.13 | 3.07 | 157 | 1.48 | 1.48 | 4.08 | 97.82 | 77.38 | 3.53 | 175 | 3.29 | 3.29 | 7.23 | 91.86 | 42.13 | 3.11 | 157 | 1.52 | 1.52 | 3.77 | 97.82 | 77.38 | 3.4 | ||||||||||||||||||||||||||||
▶PdV_kernel.f90: 111 - 31.81%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105 | 2.24 | 2.24 | 5.24 | 100 | 50 | 4.08 | 154 | 0.01 | 0.01 | 0.03 | 100 | 98.21 | 4.4 | 109 | 2.19 | 2.19 | 6.05 | 100 | 100 | 4.39 | 105 | 2.16 | 2.16 | 4.75 | 100 | 50 | 4.04 | 206 | 2.11 | 2.1 | 4.9 | 100 | 50 | 4.05 | 109 | 2.15 | 2.15 | 5.31 | 100 | 100 | 4.43 | ||||||||||||||
| 153 | 2.14 | 2.14 | 5.5 | 100 | 100 | 4.7 | 207 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 1.05 | ||||||||||||||||||||||||||||||||||||||||||
| 205 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶flux_calc_kernel.f90: 56 - 28.28%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 267 | 1.47 | 1.47 | 3.44 | 100 | 50 | 2.16 | 342 | 2.63 | 2.63 | 6.78 | 100 | 100 | 1.21 | 235 | 1.7 | 1.7 | 4.7 | 100 | 100 | 1.87 | 267 | 1.69 | 1.69 | 3.71 | 100 | 50 | 1.88 | 447 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 235 | 2.3 | 2.3 | 5.69 | 100 | 100 | 1.4 | ||||||||||||||
| 343 | 0.02 | 0.02 | 0.05 | 100 | 95.83 | 0.6 | 446 | 1.67 | 1.67 | 3.9 | 100 | 50 | 1.88 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 83 - 25.37%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 118 | 1.95 | 1.95 | 4.55 | 97.67 | 43.24 | 4.04 | 121 | 1.24 | 1.24 | 3.43 | 97.54 | 82.07 | 5.33 | 118 | 1.91 | 1.91 | 4.21 | 97.67 | 43.24 | 4.01 | 121 | 1.25 | 1.25 | 3.1 | 97.54 | 82.07 | 5.24 | ||||||||||||||||||||||||||||
| 129 | 1.71 | 1.71 | 4.73 | 98.01 | 83.06 | 3.99 | 129 | 2.16 | 2.16 | 5.35 | 98.01 | 83.06 | 3.16 | ||||||||||||||||||||||||||||||||||||||||||
▶calc_dt_kernel.f90: 92 - 25.24%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 244 | 2.64 | 2.65 | 6.19 | 95.32 | 48.25 | 3.04 | 211 | 2.55 | 2.55 | 7.03 | 95.25 | 95.84 | 3.24 | 244 | 2.61 | 2.61 | 5.75 | 95.32 | 48.25 | 3.03 | 211 | 2.54 | 2.54 | 6.27 | 95.25 | 95.84 | 3.24 | ||||||||||||||||||||||||||||
▶accelerate_kernel.f90: 62 - 24.59%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 111 | 1.77 | 1.77 | 4.14 | 100 | 50 | 4.16 | 165 | 1.59 | 1.59 | 4.09 | 100 | 100 | 6.34 | 115 | 1.68 | 1.68 | 4.64 | 100 | 100 | 6.23 | 111 | 1.66 | 1.66 | 3.65 | 100 | 50 | 4.14 | 221 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 115 | 1.74 | 1.74 | 4.28 | 100 | 100 | 6.11 | ||||||||||||||
| 222 | 1.62 | 1.62 | 3.78 | 100 | 50 | 4.2 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶viscosity_kernel.f90: 56 - 23.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 679 | 3.64 | 3.64 | 8.45 | 95.4 | 48.28 | 4.11 | 1221 | 3.47 | 3.47 | 8.94 | 100 | 100 | 5.36 | 1351 | 2.4 | 2.4 | 5.6 | 93.86 | 47.7 | 4.1 | |||||||||||||||||||||||||||||||||||
| 1352 | 0.01 | 0.01 | 0.03 | 8.22 | 13.53 | 1.45 | |||||||||||||||||||||||||||||||||||||||||||||||||
| 1350 | 0 | 0 | 0.01 | 8.22 | 13.53 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶ideal_gas_kernel.f90: 50 - 22.51%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 393 | 1.77 | 1.77 | 4.56 | 100 | 100 | 3.47 | 280 | 1.84 | 1.84 | 5.07 | 100 | 100 | 2.96 | 321 | 1.82 | 1.81 | 3.99 | 100 | 50 | 2.74 | 503 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 280 | 1.87 | 1.87 | 4.63 | 100 | 100 | 2.88 | |||||||||||||||||||||
| 504 | 1.82 | 1.83 | 4.25 | 100 | 50 | 3.02 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 202 - 22.34%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 36 | 2.49 | 2.49 | 5.76 | 14.04 | 14.25 | 1.05 | 175 | 0 | 0 | 0.01 | 98.29 | 89.5 | NA | 234 | 4.54 | 4.54 | 10.57 | 79.3 | 21.98 | 2.03 | |||||||||||||||||||||||||||||||||||
| 174 | 2.33 | 2.33 | 6 | 97.62 | 84.75 | 4.79 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶calc_dt_kernel.f90: 99 - 20.57%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 222 | 4.63 | 4.63 | 10.73 | 8.72 | 13.59 | 1.04 | 201 | 4.6 | 4.6 | 9.84 | 8.8 | 13.6 | 1.03 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 247 - 19.19%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 173 | 1.31 | 1.31 | 3.07 | 100 | 50 | 1.97 | 228 | 0 | 0 | 0.01 | 100 | 90 | NA | 163 | 1.32 | 1.32 | 3.64 | 100 | 100 | 1.93 | 173 | 1.33 | 1.33 | 2.92 | 100 | 50 | 1.92 | 303 | 1.35 | 1.35 | 3.15 | 100 | 50 | 1.88 | 163 | 1.3 | 1.3 | 3.21 | 100 | 100 | 2 | ||||||||||||||
| 230 | 0.01 | 0.01 | 0.04 | 100 | 90 | 0.8 | 302 | 0 | 0 | 0.01 | 100 | 50 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 229 | 1.21 | 1.21 | 3.12 | 100 | 100 | 2.09 | 301 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 183 - 19.16%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 163 | 1.31 | 1.31 | 3.07 | 100 | 50 | 1.9 | 242 | 0.01 | 0.01 | 0.04 | 100 | 90 | 1.6 | 155 | 1.31 | 1.31 | 3.6 | 100 | 100 | 1.94 | 163 | 1.31 | 1.31 | 2.88 | 100 | 50 | 1.94 | 319 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 155 | 1.3 | 1.31 | 3.22 | 100 | 100 | 1.94 | ||||||||||||||
| 243 | 1.29 | 1.29 | 3.33 | 100 | 100 | 1.94 | 320 | 0.01 | 0.01 | 0.02 | 100 | 50 | 1.2 | ||||||||||||||||||||||||||||||||||||||||||
| 321 | 1.28 | 1.28 | 2.99 | 100 | 50 | 1.96 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 152 - 19.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 165 | 0 | 0 | 0.01 | 12.26 | 13.78 | NA | 94 | 1.86 | 1.86 | 4.3 | 8.33 | 13.54 | 1.04 | 246 | 2.76 | 2.76 | 7.11 | 100 | 96.21 | 4.61 | 150 | 1.86 | 1.86 | 3.97 | 8.33 | 13.54 | 1 | 325 | 1.55 | 1.55 | 3.61 | 97.44 | 46.79 | 3 | |||||||||||||||||||||
| 247 | 0.01 | 0.01 | 0.03 | 100 | 87.74 | 5.2 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶PdV_kernel.f90: 69 - 18.6%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 102 | 1.9 | 1.9 | 4.45 | 100 | 50 | 3.15 | 107 | 1.87 | 1.87 | 5.15 | 100 | 100 | 3.79 | 102 | 1.91 | 1.9 | 4.19 | 100 | 50 | 3.02 | 107 | 1.95 | 1.95 | 4.81 | 100 | 100 | 3.66 | ||||||||||||||||||||||||||||
▶viscosity_kernel.f90: 53 - 17.73%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1008 | 2.55 | 2.55 | 5.98 | 95.54 | 47.88 | 4.13 | 783 | 1.23 | 1.23 | 3.4 | 95.69 | 90.18 | 8.36 | 1008 | 2.44 | 2.44 | 5.36 | 95.54 | 47.88 | 4.18 | 783 | 1.21 | 1.21 | 2.99 | 95.69 | 90.18 | 8.21 | ||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 110 - 17.73%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 28 | 2.11 | 2.11 | 4.88 | 14.55 | 14.2 | 1.05 | 188 | 1.17 | 1.17 | 3.01 | 100 | 93.15 | 8.15 | 117 | 2.11 | 2.11 | 4.51 | 14.29 | 14.17 | 1.02 | 252 | 2.28 | 2.28 | 5.31 | 99.01 | 46.14 | 4.01 | ||||||||||||||||||||||||||||
| 189 | 0 | 0 | 0.01 | 100 | 92.26 | NA | 253 | 0 | 0 | 0.01 | 17.54 | 14.58 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 203 - 17.59%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 177 | 1.06 | 1.06 | 2.49 | 100 | 50 | 2.81 | 235 | 0.76 | 0.76 | 1.97 | 100 | 100 | 4.6 | 167 | 1.19 | 1.18 | 3.27 | 100 | 100 | 2.68 | 177 | 1.31 | 1.31 | 2.89 | 100 | 50 | 2.4 | 311 | 1.3 | 1.3 | 3.04 | 100 | 50 | 2.6 | 167 | 1.58 | 1.58 | 3.91 | 100 | 100 | 2.03 | ||||||||||||||
| 312 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 310 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 204 - 17.18%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 127 | 3.79 | 3.79 | 8.86 | 98.35 | 45.25 | 2.41 | 127 | 3.79 | 3.78 | 8.32 | 98.35 | 45.25 | 2.41 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 138 - 16.79%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 168 | 1.09 | 1.09 | 2.54 | 100 | 50 | 2.75 | 249 | 0.72 | 0.72 | 1.86 | 100 | 100 | 4.81 | 159 | 1.17 | 1.17 | 3.24 | 100 | 100 | 2.74 | 168 | 1.29 | 1.29 | 2.85 | 100 | 50 | 2.43 | 329 | 1.11 | 1.11 | 2.58 | 100 | 50 | 2.98 | 159 | 1.49 | 1.49 | 3.69 | 100 | 100 | 2.13 | ||||||||||||||
| 250 | 0.01 | 0.01 | 0.03 | 100 | 95.24 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 86 - 15%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 256 | 0 | 0 | 0.01 | 100 | 92.31 | NA | 177 | 1.23 | 1.23 | 3.39 | 100 | 100 | 1 | 188 | 1.31 | 1.31 | 2.89 | 100 | 50 | 1 | 338 | 0 | 0 | 0.01 | 100 | 50 | NA | 177 | 1.5 | 1.5 | 3.72 | 100 | 100 | 0.87 | |||||||||||||||||||||
| 257 | 0.83 | 0.83 | 2.13 | 100 | 100 | 1.56 | 339 | 1.21 | 1.21 | 2.81 | 100 | 50 | 1.07 | ||||||||||||||||||||||||||||||||||||||||||
| 258 | 0.01 | 0.01 | 0.03 | 100 | 92.31 | 0 | 337 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 96 - 14.68%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 260 | 0 | 0 | 0.01 | 100 | 92.31 | NA | 175 | 1.19 | 1.18 | 3.27 | 100 | 100 | 1.05 | 186 | 1.27 | 1.27 | 2.8 | 100 | 50 | 0.99 | 344 | 1.26 | 1.26 | 2.93 | 100 | 50 | 1 | 175 | 1.53 | 1.53 | 3.78 | 100 | 100 | 0.82 | |||||||||||||||||||||
| 261 | 0.73 | 0.73 | 1.88 | 100 | 100 | 1.7 | |||||||||||||||||||||||||||||||||||||||||||||||||
| 262 | 0 | 0 | 0.01 | 100 | 92.31 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 256 - 12.67%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 171 | 1.08 | 1.08 | 2.78 | 100 | 100 | 3.26 | 127 | 1.03 | 1.03 | 2.85 | 100 | 100 | 3.1 | 125 | 1 | 1 | 2.21 | 100 | 50 | 3.06 | 230 | 1.04 | 1.04 | 2.42 | 100 | 50 | 3.01 | 127 | 0.97 | 0.97 | 2.41 | 100 | 100 | 3.29 | |||||||||||||||||||||
▶calc_dt_kernel.f90: 94 - 12.55%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 327 | 0 | 0 | 0.01 | 100 | 98.39 | NA | 425 | 0 | 0 | 0.01 | 8.42 | 13.55 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 326 | 2.54 | 2.54 | 6.54 | 100 | 100 | 4.94 | 427 | 0.02 | 0.02 | 0.05 | 8.42 | 13.55 | 0.55 | ||||||||||||||||||||||||||||||||||||||||||
| 426 | 2.55 | 2.55 | 5.94 | 100 | 50 | 4.07 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶ideal_gas_kernel.f90: 49 - 12.47%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 321 | 1.87 | 1.87 | 4.36 | 100 | 50 | 2.69 | 235 | 1.82 | 1.82 | 4.21 | 100 | 50 | 2.76 | 241 | 1.83 | 1.83 | 3.9 | 100 | 50 | 2.76 | |||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 165 - 12.09%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 185 | 0.97 | 0.97 | 2.49 | 100 | 100 | 3.58 | 119 | 1.01 | 1.01 | 2.8 | 100 | 100 | 3.16 | 116 | 0.99 | 0.99 | 2.18 | 100 | 50 | 3.03 | 247 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0.45 | 119 | 0.94 | 0.94 | 2.32 | 100 | 100 | 3.4 | |||||||||||||||||||||
| 248 | 0.97 | 0.98 | 2.27 | 100 | 50 | 3.07 | |||||||||||||||||||||||||||||||||||||||||||||||||
| 249 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 213 - 12.06%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 232 | 2.87 | 2.87 | 7.39 | 98.15 | 83.91 | 4.44 | 158 | 2.17 | 2.17 | 4.65 | 7.69 | 13.46 | 1.01 | 308 | 0 | 0 | 0.01 | 18.75 | 14.84 | NA | |||||||||||||||||||||||||||||||||||
| 233 | 0 | 0 | 0.01 | 98.18 | 84.66 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 106 - 11.7%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 269 | 0.59 | 0.59 | 1.52 | 100 | 100 | 1 | 173 | 0.86 | 0.86 | 2.37 | 100 | 100 | 0.74 | 184 | 1.19 | 1.19 | 2.62 | 100 | 50 | 0.53 | 352 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 173 | 1.06 | 1.06 | 2.62 | 100 | 100 | 0.6 | |||||||||||||||||||||
| 355 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 354 | 1.1 | 1.1 | 2.55 | 100 | 50 | 0.58 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 116 - 11.45%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 264 | 0 | 0 | 0.01 | 100 | 88.89 | NA | 171 | 0.86 | 0.86 | 2.38 | 100 | 100 | 0.78 | 181 | 1.14 | 1.14 | 2.52 | 100 | 50 | 0.57 | 350 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 171 | 1.07 | 1.07 | 2.64 | 100 | 100 | 0.62 | |||||||||||||||||||||
| 265 | 0.59 | 0.59 | 1.53 | 100 | 100 | 1.09 | 349 | 1.01 | 1.02 | 2.36 | 100 | 50 | 0.62 | ||||||||||||||||||||||||||||||||||||||||||
▶reset_field_kernel.f90: 62 - 10.99%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 535 | 0.02 | 0.02 | 0.05 | 100 | 83.33 | 0 | 364 | 0.77 | 0.77 | 2.12 | 100 | 100 | 0 | 432 | 1.19 | 1.19 | 2.63 | 100 | 50 | 0 | 655 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 364 | 0.95 | 0.95 | 2.35 | 100 | 100 | 0 | |||||||||||||||||||||
| 534 | 0.54 | 0.54 | 1.39 | 100 | 100 | 0 | 653 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 654 | 1.04 | 1.04 | 2.43 | 100 | 50 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶reset_field_kernel.f90: 52 - 10.92%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 537 | 0.56 | 0.56 | 1.44 | 100 | 100 | 0 | 366 | 0.77 | 0.77 | 2.12 | 100 | 100 | 0 | 435 | 1.13 | 1.13 | 2.48 | 100 | 50 | 0 | 659 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 366 | 0.94 | 0.94 | 2.33 | 100 | 100 | 0 | |||||||||||||||||||||
| 538 | 0.02 | 0.02 | 0.05 | 100 | 83.33 | 0 | 658 | 1.07 | 1.07 | 2.49 | 100 | 50 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶revert_kernel.f90: 47 - 10.55%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 552 | 0.55 | 0.55 | 1.41 | 100 | 100 | 0 | 376 | 0.76 | 0.76 | 2.09 | 100 | 100 | 0 | 446 | 1.03 | 1.03 | 2.26 | 100 | 50 | 0 | 678 | 1.03 | 1.04 | 2.41 | 100 | 50 | 0 | 376 | 0.96 | 0.96 | 2.38 | 100 | 100 | 0 | |||||||||||||||||||||
▶PdV_kernel.f90: 114 - 10.33%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 2.35 | 2.35 | 5.45 | 95 | 48.13 | 3.17 | 99 | 2.28 | 2.28 | 4.88 | 95 | 48.13 | 3.51 | ||||||||||||||||||||||||||||||||||||||||||
▶PdV_kernel.f90: 82 - 8.7%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7 | 1.95 | 1.95 | 4.52 | 94.44 | 47.92 | 3.4 | 101 | 1.95 | 1.95 | 4.18 | 94.44 | 47.92 | 3.93 | ||||||||||||||||||||||||||||||||||||||||||
▶PdV_kernel.f90: 75 - 8.68%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 156 | 1.77 | 1.77 | 4.56 | 100 | 100 | 5.51 | 209 | 0 | 0 | 0.01 | 3.03 | 12.88 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 211 | 0.01 | 0.01 | 0.02 | 3.03 | 12.88 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||
| 210 | 1.75 | 1.75 | 4.09 | 100 | 50 | 4.19 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 85 - 8.34%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 188 | 1.05 | 1.05 | 2.47 | 100 | 50 | 1.26 | 108 | 1.12 | 1.12 | 2.58 | 100 | 50 | 1.15 | 164 | 1.54 | 1.54 | 3.29 | 100 | 50 | 0.84 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 95 - 8.25%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 186 | 1.03 | 1.03 | 2.41 | 100 | 50 | 1.23 | 110 | 1.1 | 1.1 | 2.55 | 100 | 50 | 1.15 | 166 | 1.54 | 1.54 | 3.29 | 100 | 50 | 0.82 | |||||||||||||||||||||||||||||||||||
▶accelerate_kernel.f90: 67 - 7.95%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11 | 1.8 | 1.8 | 4.16 | 100 | 50 | 4.26 | 111 | 1.77 | 1.77 | 3.79 | 100 | 50 | 4.11 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 90 - 7.88%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 192 | 0.4 | 0.4 | 1.04 | 100 | 100 | 2.4 | 125 | 0.65 | 0.65 | 1.8 | 100 | 100 | 1.53 | 122 | 0.64 | 0.64 | 1.42 | 100 | 50 | 1.53 | 256 | 0 | 0 | 0.01 | 100 | 50 | NA | 125 | 0.84 | 0.84 | 2.07 | 100 | 100 | 1.14 | |||||||||||||||||||||
| 257 | 0.66 | 0.66 | 1.54 | 100 | 50 | 1.48 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 182 - 7.83%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 178 | 0.37 | 0.37 | 0.95 | 100 | 100 | 2.53 | 133 | 0.65 | 0.65 | 1.79 | 100 | 100 | 1.51 | 132 | 0.63 | 0.63 | 1.4 | 100 | 50 | 1.5 | 239 | 0.71 | 0.71 | 1.65 | 100 | 50 | 1.32 | 133 | 0.82 | 0.82 | 2.04 | 100 | 100 | 1.15 | |||||||||||||||||||||
▶viscosity_kernel.f90: 55 - 7.71%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 715 | 3.61 | 3.61 | 7.71 | 95.4 | 48.28 | 4.06 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 255 - 7.19%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 125 | 1.07 | 1.08 | 2.52 | 100 | 50 | 2.73 | 34 | 1.06 | 1.07 | 2.47 | 100 | 50 | 2.81 | 123 | 1.03 | 1.03 | 2.2 | 100 | 50 | 3.03 | |||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 164 - 7%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 116 | 1.03 | 1.04 | 2.42 | 100 | 50 | 2.92 | 26 | 1.06 | 1.06 | 2.44 | 100 | 50 | 2.8 | 115 | 1 | 1 | 2.14 | 100 | 50 | 3.05 | |||||||||||||||||||||||||||||||||||
▶flux_calc_kernel.f90: 58 - 6.88%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 144 | 1.44 | 1.44 | 3.34 | 80 | 42.5 | 2.33 | 212 | 1.65 | 1.65 | 3.54 | 80 | 42.5 | 2.02 | ||||||||||||||||||||||||||||||||||||||||||
▶reset_field_kernel.f90: 61 - 6.67%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 432 | 0.87 | 0.87 | 2.04 | 100 | 50 | 0 | 203 | 0.89 | 0.89 | 2.06 | 100 | 50 | 0 | 322 | 1.2 | 1.21 | 2.57 | 100 | 50 | 0 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 105 - 6.56%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 184 | 0.83 | 0.83 | 1.94 | 100 | 50 | 0.75 | 112 | 0.87 | 0.87 | 2 | 100 | 50 | 0.73 | 168 | 1.22 | 1.22 | 2.62 | 100 | 50 | 0.51 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 115 - 6.56%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 181 | 0.85 | 0.85 | 1.99 | 100 | 50 | 0.74 | 114 | 0.88 | 0.87 | 2.03 | 100 | 50 | 0.72 | 170 | 1.19 | 1.19 | 2.54 | 100 | 50 | 0.55 | |||||||||||||||||||||||||||||||||||
▶reset_field_kernel.f90: 51 - 6.2%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 435 | 0.88 | 0.88 | 2.06 | 100 | 50 | 0 | 205 | 0.78 | 0.78 | 1.81 | 100 | 50 | 0 | 324 | 1.09 | 1.09 | 2.33 | 100 | 50 | 0 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 193 - 6.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179 | 0.36 | 0.36 | 0.85 | 100 | 50 | 3.11 | 240 | 0 | 0 | 0.01 | 100 | 90.91 | NA | 169 | 0.33 | 0.33 | 0.92 | 100 | 100 | 3.92 | 179 | 0.48 | 0.48 | 1.07 | 100 | 50 | 2.53 | 317 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0.2 | 169 | 0.46 | 0.46 | 1.14 | 100 | 100 | 2.73 | ||||||||||||||
| 239 | 0.34 | 0.34 | 0.89 | 100 | 100 | 3.71 | 316 | 0.47 | 0.47 | 1.11 | 100 | 50 | 2.61 | ||||||||||||||||||||||||||||||||||||||||||
| 315 | 0 | 0 | 0.01 | 100 | 50 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 192 - 6.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182 | 0.37 | 0.37 | 0.95 | 100 | 100 | 0.86 | 131 | 0.44 | 0.44 | 1.21 | 100 | 100 | 0.68 | 130 | 0.63 | 0.63 | 1.37 | 100 | 50 | 0.53 | 244 | 0.48 | 0.48 | 1.12 | 100 | 50 | 0.68 | 131 | 0.54 | 0.54 | 1.33 | 100 | 100 | 0.59 | |||||||||||||||||||||
| 181 | 0.01 | 0.01 | 0.03 | 100 | 87.5 | 0.4 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶revert_kernel.f90: 46 - 5.97%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 446 | 0.83 | 0.83 | 1.95 | 100 | 50 | 0 | 215 | 0.79 | 0.79 | 1.83 | 100 | 50 | 0 | 334 | 1.02 | 1.02 | 2.19 | 100 | 50 | 0 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 248 - 5.9%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 100 | 1.31 | 1.31 | 3.04 | 100 | 50 | 1.98 | 156 | 1.34 | 1.34 | 2.86 | 100 | 50 | 1.91 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 207 - 5.85%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 104 | 1.13 | 1.13 | 2.63 | 100 | 50 | 2.75 | 160 | 1.5 | 1.5 | 3.22 | 100 | 50 | 2.12 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 100 - 5.84%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196 | 0.41 | 0.41 | 1.05 | 100 | 100 | 0.71 | 123 | 0.42 | 0.42 | 1.17 | 100 | 100 | 0.74 | 120 | 0.55 | 0.55 | 1.2 | 100 | 50 | 0.57 | 262 | 0.44 | 0.44 | 1.04 | 100 | 50 | 0.7 | 123 | 0.56 | 0.56 | 1.38 | 100 | 100 | 0.57 | |||||||||||||||||||||
▶advec_mom_kernel.f90: 184 - 5.8%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 92 | 1.33 | 1.33 | 3.08 | 100 | 50 | 1.9 | 148 | 1.27 | 1.27 | 2.72 | 100 | 50 | 2 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 128 - 5.65%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 170 | 0.35 | 0.35 | 0.81 | 100 | 50 | 3.25 | 253 | 0.28 | 0.28 | 0.73 | 100 | 100 | 4.5 | 161 | 0.29 | 0.29 | 0.81 | 100 | 100 | 4.43 | 170 | 0.45 | 0.45 | 0.99 | 100 | 50 | 2.76 | 334 | 0.5 | 0.5 | 1.16 | 100 | 50 | 2.47 | 161 | 0.46 | 0.46 | 1.14 | 100 | 100 | 2.77 | ||||||||||||||
| 335 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 143 - 5.48%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 96 | 1.11 | 1.11 | 2.57 | 100 | 50 | 2.74 | 152 | 1.36 | 1.36 | 2.91 | 100 | 50 | 2.34 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 172 - 5.4%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 102 | 2.33 | 2.33 | 5.4 | 7.89 | 13.49 | 1.01 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 201 - 5.29%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 124 | 2.47 | 2.47 | 5.29 | 13.33 | 14.06 | 1.03 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 214 - 4.35%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 307 | 1.87 | 1.87 | 4.35 | 97.38 | 44.01 | 2.67 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 181 - 4%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 132 | 0.52 | 0.52 | 1.23 | 100 | 50 | 1.83 | 38 | 0.55 | 0.55 | 1.27 | 100 | 50 | 1.73 | 127 | 0.7 | 0.7 | 1.5 | 100 | 50 | 1.36 | |||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 89 - 3.92%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122 | 0.51 | 0.52 | 1.21 | 100 | 50 | 1.87 | 30 | 0.56 | 0.56 | 1.29 | 100 | 50 | 1.73 | 119 | 0.66 | 0.66 | 1.42 | 100 | 50 | 1.48 | |||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 191 - 3.27%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 130 | 0.44 | 0.44 | 1.03 | 100 | 50 | 0.75 | 40 | 0.42 | 0.42 | 0.98 | 100 | 50 | 0.74 | 129 | 0.59 | 0.59 | 1.26 | 100 | 50 | 0.55 | |||||||||||||||||||||||||||||||||||
▶advec_cell_kernel.f90: 99 - 3.19%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 120 | 0.42 | 0.42 | 0.98 | 100 | 50 | 0.72 | 32 | 0.43 | 0.43 | 1 | 100 | 50 | 0.73 | 121 | 0.56 | 0.56 | 1.21 | 100 | 50 | 0.56 | |||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 131 - 1.82%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 98 | 0.32 | 0.32 | 0.75 | 100 | 50 | 3.51 | 154 | 0.5 | 0.5 | 1.07 | 100 | 50 | 2.48 | ||||||||||||||||||||||||||||||||||||||||||
▶advec_mom_kernel.f90: 196 - 1.81%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 106 | 0.34 | 0.34 | 0.8 | 100 | 50 | 3.28 | 162 | 0.47 | 0.47 | 1.01 | 100 | 50 | 2.65 | ||||||||||||||||||||||||||||||||||||||||||
▶field_summary_kernel.f90: 62 - 1.37%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 255 | 0.09 | 0.09 | 0.21 | 90 | 43.75 | 4.62 | 255 | 0.16 | 0.16 | 0.37 | 0 | 12.5 | 1.07 | 222 | 0.05 | 0.05 | 0.14 | 90.91 | 78.41 | 14.17 | 255 | 0.1 | 0.1 | 0.22 | 90 | 43.75 | 3.52 | 208 | 0.13 | 0.13 | 0.28 | 0 | 12.5 | 1.1 | 222 | 0.06 | 0.06 | 0.15 | 90.91 | 78.41 | 10.41 | ||||||||||||||
▶field_summary_kernel.f90: 58 - 0.6%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 336 | 0.12 | 0.12 | 0.32 | 100 | 100 | 8.47 | 439 | 0.12 | 0.12 | 0.27 | 100 | 50 | 4 | ||||||||||||||||||||||||||||||||||||||||||
| 337 | 0 | 0 | 0.01 | 100 | 96.67 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶<unknown>: 0 - 0.24%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1902 | 0.02 | 0.02 | 0.06 | 100 | 50 | 0 | 154 | 0 | 0 | 0 | 0 | 0 | NA | 244 | 0 | 0 | 0 | 0 | 0 | NA | 1675 | 0.02 | 0.02 | 0.06 | 100 | 50 | 0 | 1902 | 0.01 | 0.01 | 0.03 | 100 | 50 | 0 | 353 | 0 | 0 | 0 | 0 | 0 | NA | 1675 | 0.03 | 0.03 | 0.07 | 100 | 50 | 0 | |||||||
| 162 | 0 | 0 | 0 | 0 | 0 | NA | 252 | 0 | 0 | 0 | 0 | 0 | NA | 1125 | 0 | 0 | 0.01 | 0 | 9.38 | NA | 165 | 0 | 0 | 0 | 0 | 0 | NA | 240 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
| 172 | 0 | 0 | 0 | 0 | 0 | NA | 254 | 0 | 0 | 0 | 0 | 0 | NA | 172 | 0 | 0 | 0 | 0 | 0 | NA | 243 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
| 268 | 0 | 0 | 0 | 0 | 0 | NA | 505 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 166 | 0 | 0 | 0 | 0 | 0 | NA | 229 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 172 | 0 | 0 | 0 | 0 | 0 | NA | 438 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 1222 | 0 | 0 | 0 | 0 | 0 | NA | 445 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 177 | 0 | 0 | 0 | 0 | 0 | NA | 223 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 236 | 0 | 0 | 0 | 0 | 0 | NA | 440 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 186 | 0 | 0 | 0 | 0 | 0 | NA | 261 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 193 | 0 | 0 | 0 | 0 | 0 | NA | 322 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 266 | 0 | 0 | 0 | 0 | 0 | NA | 251 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 270 | 0 | 0 | 0 | 0 | 0 | NA | 326 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 157 | 0 | 0 | 0 | 0 | 0 | NA | 328 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 238 | 0 | 0 | 0 | 0 | 0 | NA | 347 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||||||||||||||||
| 306 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 304 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 314 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 1632 | 0 | 0 | 0.01 | 0 | 9.38 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 324 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 231 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 332 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 333 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 330 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 342 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
| 340 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 263 - 0.15%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 584 | 0.02 | 0.02 | 0.06 | 0 | 10.94 | 0 | 715 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 584 | 0.02 | 0.02 | 0.05 | 0 | 10.94 | 0 | 848 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | ||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 648 - 0.12%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 479 | 0.01 | 0.02 | 0.04 | 33.33 | 16.67 | 0 | 593 | 0.01 | 0.01 | 0.03 | 33.33 | 16.67 | 0 | 479 | 0.01 | 0.01 | 0.02 | 33.33 | 16.67 | 0 | 722 | 0.01 | 0.01 | 0.03 | 33.33 | 16.67 | 0 | ||||||||||||||||||||||||||||
▶build_field.f90: 103 - 0.08%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 33 | 0.02 | 0.02 | 0.05 | 100 | 100 | 0 | 41 | 0.01 | 0.01 | 0.03 | 100 | 50 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 558 - 0.08%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 503 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 621 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 503 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 751 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | ||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 177 - 0.07%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 608 | 0.01 | 0.02 | 0.04 | 0 | 10.94 | 0 | 608 | 0 | 0 | 0.01 | 0 | 10.94 | NA | 876 | 0.01 | 0.01 | 0.02 | 33.33 | 16.67 | 0 | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 568 - 0.07%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 500 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | 617 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 500 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 253 - 0.07%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 587 | 0.01 | 0.02 | 0.04 | 0 | 12.5 | 0 | 719 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 587 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 851 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 658 - 0.07%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 476 | 0.01 | 0.01 | 0.02 | 14.29 | 12.5 | 0 | 589 | 0.01 | 0.01 | 0.04 | 87.5 | 69.53 | 0 | 476 | 0 | 0 | 0.01 | 14.29 | 12.5 | NA | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 525 - 0.06%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 512 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 631 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 512 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 761 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 119 - 0.06%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 332 | 0 | 0 | 0.01 | 100 | 50 | NA | 404 | 0 | 0 | 0.01 | 100 | 100 | NA | 288 | 0 | 0 | 0.01 | 100 | 100 | NA | 332 | 0 | 0 | 0.01 | 100 | 50 | NA | 516 | 0 | 0 | 0.01 | 100 | 50 | NA | 288 | 0 | 0 | 0.01 | 100 | 100 | NA | ||||||||||||||
▶update_halo_kernel.f90: 306 - 0.05%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 701 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 572 | 0.01 | 0.01 | 0.02 | 0 | 10.94 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 615 - 0.05%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 488 | 0 | 0 | 0.01 | 14.29 | 12.5 | NA | 603 | 0.01 | 0.01 | 0.03 | 87.5 | 69.53 | 0 | 488 | 0 | 0 | 0.01 | 14.29 | 12.5 | NA | |||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 89 - 0.05%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 296 | 0 | 0 | 0.01 | 100 | 50 | NA | 256 | 0 | 0 | 0.01 | 100 | 100 | NA | 296 | 0 | 0 | 0.01 | 100 | 50 | NA | 477 | 0 | 0 | 0.01 | 100 | 50 | NA | 256 | 0 | 0 | 0.01 | 100 | 100 | NA | |||||||||||||||||||||
▶build_field.f90: 122 - 0.05%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30 | 0.01 | 0.01 | 0.04 | 100 | 100 | 0 | 37 | 0 | 0 | 0.01 | 100 | 50 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 515 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 635 | 0.01 | 0.01 | 0.03 | 0 | 12.5 | 0 | 765 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 167 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 611 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | 747 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 611 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 744 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 452 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 561 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 452 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 690 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | ||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 105 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 290 | 0 | 0 | 0.01 | 100 | 50 | NA | 252 | 0 | 0 | 0.01 | 100 | 100 | NA | 469 | 0 | 0 | 0.01 | 100 | 50 | NA | 252 | 0 | 0 | 0.01 | 100 | 100 | NA | ||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 97 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 367 | 0 | 0 | 0.01 | 100 | 100 | NA | 293 | 0 | 0 | 0.01 | 100 | 50 | NA | 473 | 0 | 0 | 0.01 | 100 | 50 | NA | 254 | 0 | 0 | 0.01 | 100 | 100 | NA | ||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 113 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 359 | 0 | 0 | 0.01 | 100 | 100 | NA | 287 | 0 | 0 | 0.01 | 100 | 50 | NA | 465 | 0 | 0 | 0.01 | 100 | 50 | NA | 250 | 0 | 0 | 0.01 | 100 | 100 | NA | ||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 129 - 0.04%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 225 | 0.01 | 0.01 | 0.02 | 0 | 12.13 | 0 | 355 | 0 | 0 | 0.01 | 61.54 | 66.06 | NA | 461 | 0 | 0 | 0.01 | 72.62 | 35.83 | NA | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 734 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 455 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 693 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 134 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 757 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 620 | 0.01 | 0.01 | 0.02 | 0 | 10.94 | 0 | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 339 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 691 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 563 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 823 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 110 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 335 | 0 | 0 | 0.01 | 100 | 50 | NA | 521 | 0 | 0 | 0.01 | 100 | 50 | NA | 290 | 0 | 0 | 0.01 | 100 | 100 | NA | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 605 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 491 | 0.01 | 0.01 | 0.02 | 33.33 | 16.67 | 0 | 491 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶build_field.f90: 146 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23 | 0.01 | 0.01 | 0.03 | 100 | 100 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 538 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 628 | 0.01 | 0.01 | 0.03 | 33.33 | 16.67 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 128 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 329 | 0 | 0 | 0.01 | 100 | 50 | NA | 512 | 0 | 0 | 0.01 | 100 | 50 | NA | 286 | 0 | 0 | 0.01 | 100 | 100 | NA | |||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 210 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 599 | 0.01 | 0.01 | 0.02 | 0 | 12.5 | 0 | 599 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 429 - 0.03%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 663 | 0.01 | 0.01 | 0.03 | 33.33 | 16.67 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 286 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 578 | 0.01 | 0.01 | 0.02 | 0 | 10.94 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 128 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 219 | 0.01 | 0.01 | 0.02 | 0 | 12.23 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 90 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 233 | 0 | 0 | 0.01 | 50 | 31.25 | NA | 227 | 0 | 0 | 0.01 | 50 | 31.25 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 439 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 536 | 0.01 | 0.01 | 0.02 | 33.33 | 16.67 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 106 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 229 | 0 | 0 | 0.01 | 50 | 31.25 | NA | 223 | 0 | 0 | 0.01 | 50 | 31.25 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 120 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 152 | 0 | 0 | 0.01 | 50 | 31.25 | NA | 248 | 0 | 0 | 0.01 | 50 | 31.25 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 147 - 0.02%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 617 | 0 | 0 | 0.01 | 0 | 12.5 | NA | 887 | 0 | 0 | 0.01 | 0 | 12.5 | NA | ||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 701 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 575 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 349 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 687 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 190 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 740 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶advection.f90: 106 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 0 | 0 | 0.01 | 0 | 11.72 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 691 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 707 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 482 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 524 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 296 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 575 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 671 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 714 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 157 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 614 | 0 | 0 | 0.01 | 0 | 10.94 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 276 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 844 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 452 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 438 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 98 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 225 | 0 | 0 | 0.01 | 50 | 31.25 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 124 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 623 | 0 | 0 | 0.01 | 0 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 472 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 527 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 724 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 458 | 0 | 0 | 0.01 | 14.29 | 12.5 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶update_halo_kernel.f90: 714 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 461 | 0 | 0 | 0.01 | 33.33 | 16.67 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 111 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 250 | 0 | 0 | 0.01 | 25 | 21.88 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶initialise_chunk_kernel.f90: 129 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 245 | 0 | 0 | 0.01 | 100 | 50 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||
▶generate_chunk_kernel.f90: 114 - 0.01%
| Run orig_HBM | Run gcc_16_HBM | Run icc_6_HBM | Run icx_2_HBM | Run orig_DDR | Run gcc_4_DDR | Run icc_11_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 221 | 0 | 0 | 0.01 | 50 | 31.25 | NA | |||||||||||||||||||||||||||||||||||||||||||||||||

