Loops
MultiBsplineRef.hpp: 68 - 146.64%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
887 | 26.87 | 25.34 | 24.28 | 100 | 25 | 207.08 | 559 | 26.21 | 24.97 | 22.46 | 0 | 12.5 | 202.27 | 858 | 26.56 | 24.71 | 23.89 | 100 | 25 | 212.36 | 887 | 20.6 | 19.72 | 27.34 | 100 | 25 | 266.1 | 690 | 24.59 | 20.9 | 21.04 | 100 | 50 | 251.04 | 864 | 22.13 | 20.26 | 27.63 | 100 | 25 | 259.01 |
SoaDistanceTableAAOMPTarget.h: 440 - 35.99%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1871 | 9.96 | 9.4 | 9.01 | 54.55 | 15.91 | 0 | 224 | 7.77 | 7.6 | 6.84 | 20 | 15 | 0 | 1847 | 8.95 | 8.2 | 7.92 | 54.55 | 15.91 | 0 | 1871 | 3.41 | 3.13 | 4.33 | 54.55 | 15.91 | 0 | 226 | 4.19 | 3.55 | 3.57 | 27.27 | 15.91 | 0 | 1861 | 3.41 | 3.17 | 4.32 | 54.55 | 15.91 | 0 |
inner_product.hpp: 155 - 6.77%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
992 | 0.17 | 0.1 | 0.09 | 33.33 | 16.67 | 108.46 | 595 | 0.76 | 0.6 | 0.54 | 36.36 | 17.05 | 90.48 | 980 | 0.79 | 0.58 | 0.56 | 33.33 | 16.67 | 93.78 | 992 | 0.07 | 0.05 | 0.08 | 33.33 | 16.67 | 216.99 | 672 | 0.19 | 0.12 | 0.12 | 100 | 50 | 447.81 | 984 | 0.33 | 0.28 | 0.39 | 33.33 | 16.67 | 194.26 |
995 | 0.35 | 0.27 | 0.25 | 33.33 | 16.67 | 200.44 | 599 | 0.62 | 0.49 | 0.44 | 29.41 | 16.18 | 111.36 | 967 | 0.66 | 0.54 | 0.53 | 33.33 | 16.67 | 101.12 | 994 | 0.42 | 0.29 | 0.41 | 33.33 | 16.67 | 188.79 | 971 | 0.35 | 0.3 | 0.41 | 33.33 | 16.67 | 181.99 | |||||||
1007 | 0.75 | 0.59 | 0.56 | 33.33 | 16.67 | 92.22 | 661 | 0.14 | 0.1 | 0.09 | 36.36 | 17.05 | 108.3 | 965 | 0.14 | 0.1 | 0.09 | 33.33 | 16.67 | 109.18 | 995 | 0.32 | 0.24 | 0.33 | 33.33 | 16.67 | 225 | 969 | 0.07 | 0.05 | 0.07 | 33.33 | 16.67 | 217.14 | |||||||
994 | 0.68 | 0.53 | 0.51 | 33.33 | 16.67 | 103.35 | 598 | 0.45 | 0.35 | 0.31 | 33.33 | 16.67 | 153.99 | 968 | 0.35 | 0.27 | 0.26 | 33.33 | 16.67 | 201.57 | 1007 | 0.31 | 0.28 | 0.39 | 33.33 | 16.67 | 194.23 | 972 | 0.32 | 0.25 | 0.34 | 33.33 | 16.67 | 216.44 |
einspline_spo_ref.hpp: 223 - 5.7%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
889 | 1.12 | 0.93 | 0.89 | 31.58 | 15.46 | 0 | 565 | 1.24 | 1.06 | 0.96 | 11.11 | 13.89 | 0 | 860 | 1.23 | 0.98 | 0.95 | 0 | 11.93 | 0 | 889 | 0.94 | 0.7 | 0.97 | 31.58 | 15.46 | 0 | 696 | 1.4 | 0.97 | 0.98 | 11.11 | 13.89 | 0 | 866 | 0.88 | 0.69 | 0.95 | 20 | 13.13 | 0 |
inner_product.hpp: 82 - 4.62%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
998 | 0.14 | 0.1 | 0.09 | 100 | 50 | 179.29 | 44 | 1.05 | 0.92 | 0.83 | 0 | 12.5 | 74.22 | 982 | 0.38 | 0.26 | 0.25 | 100 | 50 | 69.38 | 998 | 0.11 | 0.07 | 0.1 | 100 | 50 | 253.15 | 41 | 0.55 | 0.24 | 0.24 | 100 | 50 | 300.02 | 986 | 0.17 | 0.12 | 0.17 | 100 | 50 | 149.85 |
990 | 0.06 | 0.03 | 0.03 | 100 | 50 | 129.65 | 662 | 0.06 | 0.04 | 0.04 | 0 | 12.5 | 90.25 | 963 | 0.06 | 0.03 | 0.02 | 100 | 50 | 121.64 | 990 | 0.03 | 0.02 | 0.02 | 100 | 50 | 182.68 | 745 | 0.15 | 0.08 | 0.09 | 100 | 50 | 226.28 | 967 | 0.04 | 0.02 | 0.02 | 100 | 50 | 182.96 |
897 | 0.43 | 0.33 | 0.31 | 100 | 50 | 217.21 | 597 | 0.35 | 0.27 | 0.24 | 0 | 12.5 | 67.35 | 870 | 0.44 | 0.33 | 0.32 | 100 | 50 | 219.65 | 897 | 0.37 | 0.28 | 0.39 | 100 | 50 | 258.5 | 748 | 0.04 | 0.02 | 0.02 | 100 | 50 | 178.71 | 975 | 0.12 | 0.07 | 0.09 | 100 | 50 | 257.78 |
1009 | 0.38 | 0.25 | 0.24 | 100 | 50 | 72.26 | 594 | 0.44 | 0.32 | 0.29 | 0 | 12.5 | 56.66 | 971 | 0.15 | 0.1 | 0.09 | 100 | 50 | 180.01 | 1009 | 0.19 | 0.13 | 0.17 | 100 | 50 | 138.79 | 743 | 0.45 | 0.18 | 0.18 | 100 | 50 | 100.01 | 874 | 0.39 | 0.28 | 0.38 | 100 | 50 | 262 |
TwoBodyJastrowRef.h: 342 - 4.23%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
364 | 0.37 | 0.26 | 0.25 | 100 | 50 | 208.82 | 401 | 1.07 | 1 | 0.9 | 0 | 12.5 | 163.21 | 347 | 0.36 | 0.26 | 0.25 | 100 | 50 | 208.8 | 364 | 0.25 | 0.16 | 0.23 | 100 | 50 | 335.83 | 494 | 0.68 | 0.54 | 0.55 | 100 | 50 | 301.66 | 365 | 0.21 | 0.15 | 0.21 | 100 | 50 | 359.96 |
366 | 0.38 | 0.26 | 0.25 | 100 | 50 | 208.55 | 345 | 0.34 | 0.26 | 0.25 | 100 | 50 | 205.35 | 368 | 0.31 | 0.16 | 0.23 | 100 | 50 | 334.79 | 369 | 0.2 | 0.15 | 0.2 | 100 | 50 | 361.08 | ||||||||||||||
368 | 0.33 | 0.25 | 0.24 | 100 | 50 | 214.84 | 349 | 0.38 | 0.26 | 0.25 | 100 | 50 | 208.22 | 366 | 0.27 | 0.16 | 0.22 | 100 | 50 | 337.88 | 367 | 0.21 | 0.15 | 0.2 | 100 | 50 | 362.86 |
<unknown>: 0 - 4.11%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2349 | 1.41 | 1.16 | 1.11 | 100 | 50 | 0.22 | 69 | 0.03 | 0.01 | 0.01 | 30.95 | 14.96 | 5.25 | 114 | 0.01 | 0 | 0 | 0 | 0 | NA | 2349 | 0.82 | 0.68 | 0.95 | 100 | 50 | 0.3 | 94 | 0.01 | 0 | 0 | 0 | 0 | NA | 115 | 0 | 0 | 0 | 0 | 0 | NA |
1558 | 0.01 | 0 | 0 | 0 | 0 | NA | 86 | 0.01 | 0 | 0 | 0 | 0 | NA | 356 | 0.01 | 0 | 0 | 0 | 0 | NA | 354 | 0.01 | 0 | 0 | 0 | 0 | NA | 86 | 0 | 0 | 0 | 0 | 0 | NA | 355 | 0 | 0 | 0 | 0 | 0 | NA |
367 | 0 | 0 | 0 | 0 | 0 | NA | 30 | 0 | 0 | 0 | 0 | 0 | NA | 244 | 0.01 | 0 | 0 | 0 | 0 | NA | 1558 | 0.01 | 0 | 0 | 0 | 0 | NA | 80 | 0 | 0 | 0 | 0 | 0 | NA | 1560 | 0.01 | 0 | 0 | 0 | 0 | NA |
1074 | 0.01 | 0 | 0 | 0 | 0 | NA | 74 | 0.01 | 0 | 0 | 0 | 0 | NA | 1108 | 0 | 0 | 0 | 0 | 0 | NA | 367 | 0.01 | 0 | 0 | 0 | 0 | NA | 74 | 0.04 | 0.02 | 0.02 | 30.95 | 14.96 | 2.5 | 1564 | 0.09 | 0 | 0 | 0 | 0 | NA |
1550 | 0 | 0 | 0 | 0 | 0 | NA | 75 | 0.01 | 0 | 0 | 0 | 0 | NA | 354 | 0.02 | 0 | 0 | 0 | 0 | NA | 2100 | 0.01 | 0 | 0 | 0 | 0 | NA | 44 | 0 | 0 | 0 | 0 | 0 | NA | 2096 | 0.01 | 0 | 0 | 0 | 0 | NA |
362 | 0.01 | 0 | 0 | 0 | 0 | NA | 81 | 0 | 0 | 0 | 0 | 0 | NA | 367 | 0 | 0 | 0 | 0 | 0 | NA | 362 | 0.02 | 0 | 0 | 0 | 0 | NA | 84 | 0 | 0 | 0 | 0 | 0 | NA | 361 | 0.01 | 0 | 0 | 0 | 0 | NA |
372 | 0.01 | 0 | 0 | 0 | 0 | NA | 48 | 0.01 | 0 | 0 | 0 | 0 | NA | 1538 | 0 | 0 | 0 | 0 | 0 | NA | 372 | 0.01 | 0 | 0 | 0 | 0 | NA | 91 | 0 | 0 | 0 | 0 | 0 | NA | 373 | 0.01 | 0 | 0 | 0 | 0 | NA |
1570 | 0.01 | 0 | 0 | 0 | 0 | NA | 79 | 0 | 0 | 0 | 0 | 0 | NA | 1548 | 0.01 | 0 | 0 | 0 | 0 | NA | 1570 | 0.01 | 0 | 0 | 0 | 0 | NA | 490 | 0.01 | 0 | 0 | 0 | 0 | NA | 375 | 0.01 | 0 | 0 | 0 | 0 | NA |
1574 | 0.11 | 0 | 0 | 0 | 0 | NA | 49 | 0.01 | 0 | 0 | 0 | 0 | NA | 373 | 0 | 0 | 0 | 0 | 0 | NA | 1574 | 0.1 | 0 | 0 | 0 | 0 | NA | 363 | 0.01 | 0 | 0 | 0 | 0 | NA | 368 | 0.01 | 0 | 0 | 0 | 0 | NA |
1237 | 0.01 | 0 | 0 | 0 | 0 | NA | 418 | 0.01 | 0 | 0 | 0 | 0 | NA | 372 | 0.02 | 0 | 0 | 0 | 0 | NA | 376 | 0 | 0 | 0 | 0 | 0 | NA | 362 | 0.01 | 0 | 0 | 0 | 0 | NA | 381 | 0 | 0 | 0 | 0 | 0 | NA |
376 | 0 | 0 | 0 | 0 | 0 | NA | 260 | 0 | 0 | 0 | 0 | 0 | NA | 369 | 0 | 0 | 0 | 0 | 0 | NA | 379 | 0 | 0 | 0 | 0 | 0 | NA | 757 | 0 | 0 | 0 | 0 | 0 | NA | 383 | 0.01 | 0 | 0 | 0 | 0 | NA |
379 | 0 | 0 | 0 | 0 | 0 | NA | 370 | 0 | 0 | 0 | 0 | 0 | NA | 371 | 0 | 0 | 0 | 0 | 0 | NA | 389 | 0 | 0 | 0 | 0 | 0 | NA | 558 | 0.01 | 0 | 0 | 0 | 0 | NA | 2358 | 0.9 | 0.67 | 0.91 | 100 | 50 | 0.41 |
389 | 0 | 0 | 0 | 0 | 0 | NA | 568 | 0.01 | 0 | 0 | 0 | 0 | NA | 376 | 0.02 | 0 | 0 | 0 | 0 | NA | 265 | 0.02 | 0 | 0 | 0 | 0 | NA | 207 | 0.01 | 0 | 0 | 0 | 0 | NA | 1223 | 0.01 | 0 | 0 | 0 | 0 | NA |
265 | 0.01 | 0 | 0 | 0 | 0 | NA | 261 | 0 | 0 | 0 | 0 | 0 | NA | 1217 | 0.01 | 0 | 0 | 0 | 0 | NA | 1002 | 0.01 | 0 | 0 | 0 | 0 | NA | 204 | 0.01 | 0 | 0 | 0 | 0 | NA | 1219 | 0.01 | 0 | 0 | 0 | 0 | NA |
1002 | 0.01 | 0 | 0 | 0 | 0 | NA | 368 | 0 | 0 | 0 | 0 | 0 | NA | 60 | 0 | 0 | 0 | 0 | 0 | NA | 982 | 0.01 | 0 | 0 | 0 | 0 | NA | 707 | 0 | 0 | 0 | 0 | 0 | NA | 266 | 0.02 | 0 | 0 | 0 | 0 | NA |
982 | 0 | 0 | 0 | 0 | 0 | NA | 555 | 0.03 | 0 | 0 | 0 | 0 | NA | 1259 | 0 | 0 | 0 | 0 | 0 | NA | 395 | 0.01 | 0 | 0 | 0 | 0 | NA | 205 | 0.03 | 0 | 0 | 0 | 0 | NA | 387 | 0 | 0 | 0 | 0 | 0 | NA |
394 | 0.02 | 0 | 0 | 0 | 0 | NA | 375 | 0.01 | 0 | 0 | 0 | 0 | NA | 270 | 0 | 0 | 0 | 0 | 0 | NA | 970 | 0 | 0 | 0 | 0 | 0 | NA | 698 | 0 | 0 | 0 | 0 | 0 | NA | 979 | 0.01 | 0 | 0 | 0 | 0 | NA |
395 | 0 | 0 | 0 | 0 | 0 | NA | 374 | 0.01 | 0 | 0 | 0 | 0 | NA | 2077 | 0.01 | 0 | 0 | 0 | 0 | NA | 1866 | 0 | 0 | 0 | 0 | 0 | NA | 686 | 0.02 | 0 | 0 | 0 | 0 | NA | 977 | 0 | 0 | 0 | 0 | 0 | NA |
999 | 0 | 0 | 0 | 0 | 0 | NA | 652 | 0 | 0 | 0 | 0 | 0 | NA | 2081 | 0 | 0 | 0 | 0 | 0 | NA | 896 | 0 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 950 | 0.01 | 0 | 0 | 0 | 0 | NA |
1866 | 0.01 | 0 | 0 | 0 | 0 | NA | 64 | 0 | 0 | 0 | 0 | 0 | NA | 243 | 0.01 | 0 | 0 | 0 | 0 | NA | 360 | 0.01 | 0 | 0 | 0 | 0 | NA | 209 | 0 | 0 | 0 | 0 | 0 | NA | 1354 | 0 | 0 | 0 | 0 | 0 | NA |
896 | 0 | 0 | 0 | 0 | 0 | NA | 380 | 0 | 0 | 0 | 0 | 0 | NA | 280 | 0.01 | 0 | 0 | 0 | 0 | NA | 114 | 0 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 959 | 0 | 0 | 0 | 0 | 0 | NA |
1000 | 0 | 0 | 0 | 0 | 0 | NA | 567 | 0 | 0 | 0 | 0 | 0 | NA | 281 | 0.01 | 0 | 0 | 0 | 0 | NA | 1001 | 0.01 | 0 | 0 | 0 | 0 | NA | 705 | 0 | 0 | 0 | 0 | 0 | NA | 293 | 0 | 0 | 0 | 0 | 0 | NA |
1572 | 0 | 0 | 0 | 0 | 0 | NA | 377 | 0 | 0 | 0 | 0 | 0 | NA | 282 | 0.02 | 0 | 0 | 0 | 0 | NA | 378 | 0 | 0 | 0 | 0 | 0 | NA | 445 | 0 | 0 | 0 | 0 | 0 | NA | 292 | 0 | 0 | 0 | 0 | 0 | NA |
378 | 0 | 0 | 0 | 0 | 0 | NA | 406 | 0.01 | 0 | 0 | 0 | 0 | NA | 285 | 0.01 | 0 | 0 | 0 | 0 | NA | 385 | 0 | 0 | 0 | 0 | 0 | NA | 450 | 0 | 0 | 0 | 0 | 0 | NA | 289 | 0 | 0 | 0 | 0 | 0 | NA |
391 | 0 | 0 | 0 | 0 | 0 | NA | 379 | 0 | 0 | 0 | 0 | 0 | NA | 975 | 0.01 | 0 | 0 | 0 | 0 | NA | 387 | 0 | 0 | 0 | 0 | 0 | NA | 449 | 0 | 0 | 0 | 0 | 0 | NA | 947 | 0 | 0 | 0 | 0 | 0 | NA |
393 | 0 | 0 | 0 | 0 | 0 | NA | 378 | 0 | 0 | 0 | 0 | 0 | NA | 272 | 0 | 0 | 0 | 0 | 0 | NA | 391 | 0 | 0 | 0 | 0 | 0 | NA | 626 | 0 | 0 | 0 | 0 | 0 | NA | 303 | 0.01 | 0 | 0 | 0 | 0 | NA |
61 | 0 | 0 | 0 | 0 | 0 | NA | 265 | 0.01 | 0 | 0 | 0 | 0 | NA | 269 | 0 | 0 | 0 | 0 | 0 | NA | 393 | 0 | 0 | 0 | 0 | 0 | NA | 624 | 0 | 0 | 0 | 0 | 0 | NA | 976 | 0 | 0 | 0 | 0 | 0 | NA |
1321 | 0 | 0 | 0 | 0 | 0 | NA | 569 | 0.01 | 0 | 0 | 0 | 0 | NA | 970 | 0 | 0 | 0 | 0 | 0 | NA | 304 | 0.01 | 0 | 0 | 0 | 0 | NA | 621 | 0 | 0 | 0 | 0 | 0 | NA | 846 | 0 | 0 | 0 | 0 | 0 | NA |
2100 | 0.01 | 0 | 0 | 0 | 0 | NA | 267 | 0.01 | 0 | 0 | 0 | 0 | NA | 946 | 0 | 0 | 0 | 0 | 0 | NA | 292 | 0 | 0 | 0 | 0 | 0 | NA | 617 | 0 | 0 | 0 | 0 | 0 | NA | 1547 | 0 | 0 | 0 | 0 | 0 | NA |
1549 | 0 | 0 | 0 | 0 | 0 | NA | 348 | 0.03 | 0 | 0 | 0 | 0 | NA | 840 | 0 | 0 | 0 | 0 | 0 | NA | 291 | 0 | 0 | 0 | 0 | 0 | NA | 606 | 0.01 | 0 | 0 | 0 | 0 | NA | 1120 | 0 | 0 | 0 | 0 | 0 | NA |
292 | 0 | 0 | 0 | 0 | 0 | NA | 269 | 0 | 0 | 0 | 0 | 0 | NA | 348 | 0 | 0 | 0 | 0 | 0 | NA | 1000 | 0 | 0 | 0 | 0 | 0 | NA | 467 | 0 | 0 | 0 | 0 | 0 | NA | 385 | 0 | 0 | 0 | 0 | 0 | NA |
291 | 0 | 0 | 0 | 0 | 0 | NA | 566 | 0.02 | 0 | 0 | 0 | 0 | NA | 361 | 0 | 0 | 0 | 0 | 0 | NA | 288 | 0 | 0 | 0 | 0 | 0 | NA | 464 | 0.02 | 0 | 0 | 0 | 0 | NA | 389 | 0 | 0 | 0 | 0 | 0 | NA |
997 | 0 | 0 | 0 | 0 | 0 | NA | 392 | 0.01 | 0 | 0 | 0 | 0 | NA | 2331 | 1.29 | 1.15 | 1.11 | 100 | 50 | 0.19 | 303 | 0.01 | 0 | 0 | 0 | 0 | NA | 465 | 0.01 | 0 | 0 | 0 | 0 | NA | 1308 | 0 | 0 | 0 | 0 | 0 | NA |
1131 | 0 | 0 | 0 | 0 | 0 | NA | 262 | 0.01 | 0 | 0 | 0 | 0 | NA | 292 | 0 | 0 | 0 | 0 | 0 | NA | 302 | 0.01 | 0 | 0 | 0 | 0 | NA | 43 | 0 | 0 | 0 | 0 | 0 | NA | 110 | 0.01 | 0 | 0 | 0 | 0 | NA |
288 | 0 | 0 | 0 | 0 | 0 | NA | 263 | 0.02 | 0 | 0 | 0 | 0 | NA | 291 | 0.01 | 0 | 0 | 0 | 0 | NA | 1131 | 0 | 0 | 0 | 0 | 0 | NA | 468 | 0.01 | 0 | 0 | 0 | 0 | NA | 1576 | 0 | 0 | 0 | 0 | 0 | NA |
303 | 0.01 | 0 | 0 | 0 | 0 | NA | 349 | 0.01 | 0 | 0 | 0 | 0 | NA | 363 | 0 | 0 | 0 | 0 | 0 | NA | 999 | 0 | 0 | 0 | 0 | 0 | NA | 627 | 0 | 0 | 0 | 0 | 0 | NA | 308 | 0.01 | 0 | 0 | 0 | 0 | NA |
302 | 0.01 | 0 | 0 | 0 | 0 | NA | 299 | 0.01 | 0 | 0 | 0 | 0 | NA | 1553 | 0.03 | 0 | 0 | 0 | 0 | NA | 1328 | 0 | 0 | 0 | 0 | 0 | NA | 628 | 0 | 0 | 0 | 0 | 0 | NA | 391 | 0.01 | 0 | 0 | 0 | 0 | NA |
1364 | 0 | 0 | 0 | 0 | 0 | NA | 298 | 0.01 | 0 | 0 | 0 | 0 | NA | 365 | 0 | 0 | 0 | 0 | 0 | NA | 2104 | 0 | 0 | 0 | 0 | 0 | NA | 444 | 0 | 0 | 0 | 0 | 0 | NA | 305 | 0.01 | 0 | 0 | 0 | 0 | NA |
264 | 0 | 0 | 0 | 0 | 0 | NA | 296 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 58 | 0 | 0 | 0 | 0 | 0 | NA | 58 | 0 | 0 | 0 | 0 | 0 | NA | 697 | 0.02 | 0 | 0 | 0 | 0 | NA | 304 | 0.01 | 0 | 0 | 0 | 0 | NA |
899 | 0 | 0 | 0 | 0 | 0 | NA | 413 | 0 | 0 | 0 | 0 | 0 | NA | 108 | 0 | 0 | 0 | 0 | 0 | NA | 1364 | 0 | 0 | 0 | 0 | 0 | NA | 281 | 0 | 0 | 0 | 0 | 0 | NA | 314 | 0.01 | 0 | 0 | 0 | 0 | NA |
109 | 0.01 | 0 | 0 | 0 | 0 | NA | 203 | 0.01 | 0 | 0 | 0 | 0 | NA | 1238 | 0 | 0 | 0 | 0 | 0 | NA | 309 | 0.01 | 0 | 0 | 0 | 0 | NA | 309 | 0 | 0 | 0 | 0 | 0 | NA | 1565 | 0.02 | 0 | 0 | 0 | 0 | NA |
385 | 0.01 | 0 | 0 | 0 | 0 | NA | 252 | 0 | 0 | 0 | 0 | 0 | NA | 1552 | 0.04 | 0 | 0 | 0 | 0 | NA | 1575 | 0.02 | 0 | 0 | 0 | 0 | NA | 403 | 0 | 0 | 0 | 0 | 0 | NA | 1353 | 0 | 0 | 0 | 0 | 0 | NA |
66 | 0 | 0 | 0 | 0 | 0 | NA | 360 | 0 | 0 | 0 | 0 | 0 | NA | 109 | 0.02 | 0 | 0 | 0 | 0 | NA | 307 | 0.01 | 0 | 0 | 0 | 0 | NA | 288 | 0 | 0 | 0 | 0 | 0 | NA | 1226 | 0.01 | 0 | 0 | 0 | 0 | NA |
1001 | 0 | 0 | 0 | 0 | 0 | NA | 37 | 0.05 | 0 | 0 | 0 | 0 | NA | 1115 | 0 | 0 | 0 | 0 | 0 | NA | 997 | 0 | 0 | 0 | 0 | 0 | NA | 349 | 0 | 0 | 0 | 0 | 0 | NA | 974 | 0.01 | 0 | 0 | 0 | 0 | NA |
110 | 0.02 | 0 | 0 | 0 | 0 | NA | 38 | 0.09 | 0 | 0 | 0 | 0 | NA | 69 | 0 | 0 | 0 | 0 | 0 | NA | 1572 | 0.01 | 0 | 0 | 0 | 0 | NA | 225 | 0 | 0 | 0 | 0 | 0 | NA | 63 | 0 | 0 | 0 | 0 | 0 | NA |
309 | 0 | 0 | 0 | 0 | 0 | NA | 379 | 0.01 | 0 | 0 | 0 | 0 | NA | 110 | 0.02 | 0 | 0 | 0 | 0 | NA | 66 | 0.01 | 0 | 0 | 0 | 0 | NA | 301 | 0.01 | 0 | 0 | 0 | 0 | NA | 265 | 0 | 0 | 0 | 0 | 0 | NA |
1575 | 0.02 | 0 | 0 | 0 | 0 | NA | 126 | 0 | 0 | 0 | 0 | 0 | NA | 973 | 0 | 0 | 0 | 0 | 0 | NA | 1234 | 0.02 | 0 | 0 | 0 | 0 | NA | 302 | 0.02 | 0 | 0 | 0 | 0 | NA | 1856 | 0.01 | 0 | 0 | 0 | 0 | NA |
307 | 0.01 | 0 | 0 | 0 | 0 | NA | 380 | 0.01 | 0 | 0 | 0 | 0 | NA | 1842 | 0 | 0 | 0 | 0 | 0 | NA | 374 | 0 | 0 | 0 | 0 | 0 | NA | 34 | 0 | 0 | 0 | 0 | 0 | NA | 1270 | 0.01 | 0 | 0 | 0 | 0 | NA |
114 | 0 | 0 | 0 | 0 | 0 | NA | 223 | 0.01 | 0 | 0 | 0 | 0 | NA | 1341 | 0.01 | 0 | 0 | 0 | 0 | NA | 313 | 0.01 | 0 | 0 | 0 | 0 | NA | 39 | 0.1 | 0 | 0 | 0 | 0 | NA | 978 | 0 | 0 | 0 | 0 | 0 | NA |
1279 | 0.01 | 0 | 0 | 0 | 0 | NA | 240 | 0.01 | 0 | 0 | 0 | 0 | NA | 1295 | 0 | 0 | 0 | 0 | 0 | NA | 899 | 0.01 | 0 | 0 | 0 | 0 | NA | 38 | 0.04 | 0 | 0 | 0 | 0 | NA | 876 | 0.01 | 0 | 0 | 0 | 0 | NA |
304 | 0.01 | 0 | 0 | 0 | 0 | NA | 972 | 0 | 0 | 0 | 0 | 0 | NA | 869 | 0 | 0 | 0 | 0 | 0 | NA | 483 | 0 | 0 | 0 | 0 | 0 | NA | 873 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
973 | 0 | 0 | 0 | 0 | 0 | NA | 1053 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 1279 | 0.01 | 0 | 0 | 0 | 0 | NA | 227 | 0.01 | 0 | 0 | 0 | 0 | NA | 1562 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
387 | 0 | 0 | 0 | 0 | 0 | NA | 974 | 0 | 0 | 0 | 0 | 0 | NA | 264 | 0.01 | 0 | 0 | 0 | 0 | NA | 404 | 0.01 | 0 | 0 | 0 | 0 | NA | 310 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1234 | 0.01 | 0 | 0 | 0 | 0 | NA | 1525 | 0 | 0 | 0 | 0 | 0 | NA | 1237 | 0.01 | 0 | 0 | 0 | 0 | NA | 316 | 0 | 0 | 0 | 0 | 0 | NA | 119 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
883 | 0.01 | 0 | 0 | 0 | 0 | NA | 955 | 0.01 | 0 | 0 | 0 | 0 | NA | 118 | 0.01 | 0 | 0 | 0 | 0 | NA | 339 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
360 | 0 | 0 | 0 | 0 | 0 | NA | 1535 | 0 | 0 | 0 | 0 | 0 | NA | 338 | 0 | 0 | 0 | 0 | 0 | NA | 113 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
374 | 0 | 0 | 0 | 0 | 0 | NA | 869 | 0 | 0 | 0 | 0 | 0 | NA | 112 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
869 | 0 | 0 | 0 | 0 | 0 | NA | 287 | 0.01 | 0 | 0 | 0 | 0 | NA | 1365 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
313 | 0 | 0 | 0 | 0 | 0 | NA | 1550 | 0.01 | 0 | 0 | 0 | 0 | NA | 57 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
107 | 0 | 0 | 0 | 0 | 0 | NA | 116 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
875 | 0 | 0 | 0 | 0 | 0 | NA | 1214 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
118 | 0.01 | 0 | 0 | 0 | 0 | NA | 872 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
112 | 0.01 | 0 | 0 | 0 | 0 | NA | 854 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
1365 | 0 | 0 | 0 | 0 | 0 | NA | 117 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
343 | 0.03 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||
341 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||
112 | 0.01 | 0 | 0 | 0 | 0 | NA |
MultiBsplineRef.hpp: 276 - 3.76%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
891 | 0.51 | 0.37 | 0.36 | 100 | 50 | 265.4 | 562 | 1.53 | 1 | 0.9 | 0 | 12.5 | 97.85 | 863 | 0.44 | 0.36 | 0.35 | 100 | 50 | 275.82 | 891 | 0.48 | 0.32 | 0.44 | 100 | 50 | 311.04 | 691 | 1.86 | 1.26 | 1.27 | 0 | 12.5 | 77.76 | 868 | 0.49 | 0.33 | 0.44 | 100 | 50 | 294.02 |
BsplineFunctor.h: 291 - 3%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
339 | 0.63 | 0.49 | 0.47 | 86.96 | 44.57 | 0.26 | 417 | 0.05 | 0.02 | 0.02 | 0 | 9.38 | 0.6 | 319 | 0.68 | 0.55 | 0.53 | 0 | 9.94 | 0.02 | 339 | 0.47 | 0.36 | 0.5 | 86.96 | 44.57 | 0.38 | 489 | 0.23 | 0.18 | 0.18 | 0 | 9.38 | 0.1 | 340 | 0.45 | 0.38 | 0.51 | 83.48 | 42.77 | 0.25 |
394 | 0.61 | 0.49 | 0.44 | 0 | 9.38 | 0.12 | 557 | 0.44 | 0.32 | 0.32 | 0 | 9.38 | 0.05 | ||||||||||||||||||||||||||||
466 | 0.07 | 0.03 | 0.03 | 0 | 9.38 | 0.28 |
inner_product.hpp: 211 - 2.07%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
976 | 0.57 | 0.43 | 0.41 | 85.71 | 41.07 | 0 | 655 | 0.54 | 0.43 | 0.39 | 0 | 12.5 | 0 | 949 | 0.64 | 0.49 | 0.47 | 33.33 | 16.67 | 0 | 976 | 0.23 | 0.21 | 0.29 | 85.71 | 41.07 | 0 | 711 | 0.21 | 0.21 | 0.21 | 33.33 | 16.67 | 0 | 953 | 0.25 | 0.22 | 0.3 | 85.71 | 41.07 | 0 |
TwoBodyJastrowRef.h: 324 - 1.83%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
370 | 0.41 | 0.31 | 0.3 | 100 | 50 | 261.94 | 402 | 0.52 | 0.39 | 0.35 | 0 | 12.5 | 208.14 | 352 | 0.43 | 0.33 | 0.31 | 100 | 50 | 247.23 | 370 | 0.26 | 0.2 | 0.27 | 100 | 50 | 408.83 | 495 | 0.43 | 0.35 | 0.35 | 0 | 12.5 | 232.69 | 371 | 0.24 | 0.19 | 0.25 | 100 | 50 | 427.88 |
BsplineFunctor.h: 246 - 0.66%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
398 | 0.14 | 0.08 | 0.08 | 100 | 46.88 | 643.42 | 505 | 0.23 | 0.14 | 0.13 | 0 | 12.5 | 407.32 | 377 | 0.19 | 0.12 | 0.11 | 55.26 | 30.26 | 441.45 | 398 | 0.15 | 0.09 | 0.13 | 100 | 46.88 | 561.84 | 566 | 0.15 | 0.1 | 0.1 | 100 | 48.46 | 527.72 | 394 | 0.14 | 0.08 | 0.11 | 100 | 46.88 | 622.44 |
stl_numeric.h: 140 - 0.64%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
361 | 0.11 | 0.07 | 0.07 | 100 | 50 | 259.61 | 416 | 0.03 | 0.01 | 0.01 | 0 | 12.5 | 180.91 | 342 | 0.11 | 0.07 | 0.06 | 100 | 50 | 261.49 | 361 | 0.11 | 0.06 | 0.08 | 100 | 50 | 306.18 | 555 | 0.12 | 0.08 | 0.08 | 100 | 50 | 227.99 | 362 | 0.11 | 0.06 | 0.08 | 100 | 50 | 298.44 |
362 | 0.04 | 0.02 | 0.01 | 0 | 12.5 | 75.03 | 394 | 0.02 | 0 | 0.01 | 100 | 50 | NA | 390 | 0.02 | 0 | 0.01 | 100 | 50 | NA | |||||||||||||||||||||
399 | 0.33 | 0.26 | 0.23 | 0 | 12.5 | 69.46 |
TwoBodyJastrowRef.h: 381 - 0.34%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
388 | 0.04 | 0.01 | 0.01 | 100 | 50 | 351.21 | 414 | 0.14 | 0.09 | 0.08 | 0 | 12.5 | 120.54 | 364 | 0.04 | 0.01 | 0.01 | 100 | 50 | 368.61 | 390 | 0.03 | 0.01 | 0.02 | 100 | 50 | 362.96 | 463 | 0.1 | 0.05 | 0.05 | 100 | 50 | 216.9 | 382 | 0.04 | 0.01 | 0.02 | 100 | 50 | 352.76 |
386 | 0.05 | 0.02 | 0.02 | 100 | 50 | 181.36 | 366 | 0.04 | 0.01 | 0.01 | 100 | 50 | 344.46 | 386 | 0.04 | 0.01 | 0.02 | 100 | 50 | 361.56 | 384 | 0.03 | 0.01 | 0.02 | 100 | 50 | 371.76 | ||||||||||||||
390 | 0.04 | 0.02 | 0.02 | 100 | 50 | 187.41 | 368 | 0.04 | 0.02 | 0.02 | 100 | 50 | 182.78 | 388 | 0.03 | 0.01 | 0.02 | 100 | 50 | 364.16 | 386 | 0.03 | 0.01 | 0.02 | 100 | 50 | 343.86 |
TwoBodyJastrowRef.h: 388 - 0.09%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
384 | 0.04 | 0.01 | 0.01 | 100 | 50 | 702.97 | 413 | 0.05 | 0.02 | 0.02 | 0 | 12.5 | 360.01 | 362 | 0.04 | 0.01 | 0.01 | 100 | 50 | 739.87 | 384 | 0.04 | 0.01 | 0.02 | 100 | 50 | 728.07 | 462 | 0.03 | 0.01 | 0.01 | 100 | 50 | 699.82 | 380 | 0.04 | 0.01 | 0.02 | 100 | 50 | 716.57 |
OneBodyJastrowRef.h: 214 - 0.06%
Run orig_DDR | Run gcc_15_DDR | Run icx_3_DDR | Run orig_HBM | Run gcc_9_HBM | Run icx_1_HBM | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
336 | 0.02 | 0.01 | 0.01 | 0 | 11.11 | 1.8 | 360 | 0.06 | 0.01 | 0.01 | 0 | 10.94 | 0.6 | 315 | 0.03 | 0.01 | 0.01 | 0 | 11.61 | 0.75 | 336 | 0.03 | 0.01 | 0.01 | 0 | 11.11 | 0.8 | 604 | 0.02 | 0.01 | 0.01 | 0 | 12.5 | 0 | 337 | 0.02 | 0.01 | 0.01 | 0 | 11.61 | 1.4 |