Loops
MultiBsplineRef.hpp: 68 - 93.89 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 809 | 9.40 | 8.95 | 5.90 | 100 | 100 | 87.29 | 914 | 36.83 | 35.84 | 23.41 | 100 | 100 | 97.71 | 809 | 9.43 | 9.00 | 5.93 | 100 | 100 | 86.69 | 914 | 36.76 | 35.78 | 23.34 | 100 | 100 | 97.87 |
| 810 | 9.23 | 8.89 | 5.86 | 100 | 100 | 87.83 | 811 | 9.46 | 8.88 | 5.85 | 100 | 100 | 88.11 | ||||||||||||||
| 812 | 9.56 | 9.03 | 5.95 | 100 | 100 | 86.51 | 812 | 9.50 | 9.06 | 5.97 | 100 | 100 | 86.15 | ||||||||||||||
| 811 | 9.27 | 8.87 | 5.84 | 100 | 100 | 87.93 | 810 | 9.40 | 8.88 | 5.85 | 100 | 100 | 87.88 | ||||||||||||||
| Sum on 4 analyzed binary loops (exec - 809, exec - 810, exec - 812, exec - 811) | Sum on 1 analyzed binary loop (exec - 914) | Sum on 4 analyzed binary loops (exec - 809, exec - 811, exec - 812, exec - 810) | Sum on 1 analyzed binary loop (exec - 914) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
MultiBsplineRef.hpp: 242 - 35.15 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 818 | 12.39 | 11.76 | 7.75 | 100 | 100 | 394.46 | 946 | 15.33 | 15.05 | 9.83 | 0 | 25 | 319.15 | 818 | 12.22 | 11.69 | 7.70 | 100 | 100 | 396.69 | 946 | 15.37 | 15.12 | 9.86 | 0 | 25 | 317.7 |
| Sum on 1 analyzed binary loop (exec - 818) | Sum on 1 analyzed binary loop (exec - 946) | Sum on 1 analyzed binary loop (exec - 818) | Sum on 1 analyzed binary loop (exec - 946) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
BsplineFunctor.h: 236 - 2.82 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 277 | 0.10 | 0.06 | 0.04 | 0 | 20.83 | 6.45 | 295 | 2.29 | 2.02 | 1.32 | 0 | 20.19 | 35.06 | 277 | 0.08 | 0.05 | 0.04 | 0 | 20.83 | 6.66 | 295 | 2.32 | 2.07 | 1.35 | 0 | 20.19 | 34.25 |
| 202 | 0.09 | 0.06 | 0.04 | 0 | 20.19 | 13.8 | 202 | 0.09 | 0.06 | 0.04 | 0 | 20.19 | 14.13 | ||||||||||||||
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 295) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 295) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||
| Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||||||||||||
TwoBodyJastrowRef.h: 342 - 2.53 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 315 | 0.39 | 0.32 | 0.21 | 95.24 | 96.43 | 93.21 | 530 | 1.13 | 0.96 | 0.62 | 100 | 100 | 112.61 | 315 | 0.40 | 0.33 | 0.22 | 95.24 | 96.43 | 91.47 | 530 | 1.12 | 0.95 | 0.62 | 100 | 100 | 112.5 |
| 314 | 0.42 | 0.33 | 0.22 | 95.24 | 96.43 | 92.38 | 314 | 0.40 | 0.33 | 0.22 | 95.24 | 96.43 | 92.19 | ||||||||||||||
| 313 | 0.44 | 0.32 | 0.21 | 95.24 | 96.43 | 92.21 | 313 | 0.42 | 0.32 | 0.21 | 95.24 | 96.43 | 94.87 | ||||||||||||||
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 530) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 530) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||
inner_product.hpp: 155 - 2.43 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 910 | 0.69 | 0.52 | 0.35 | 81.82 | 59.09 | 68.45 | 1107 | 0.63 | 0.52 | 0.34 | 100 | 60.87 | 68.99 | 910 | 0.73 | 0.53 | 0.35 | 81.82 | 59.09 | 68.11 | 1107 | 0.66 | 0.52 | 0.34 | 100 | 60.87 | 69.19 |
| 904 | 0.13 | 0.09 | 0.06 | 81.82 | 59.09 | 80.05 | 1118 | 0.11 | 0.07 | 0.05 | 100 | 60.87 | 97.03 | 904 | 0.14 | 0.09 | 0.06 | 81.82 | 59.09 | 84.49 | 1118 | 0.10 | 0.07 | 0.05 | 100 | 60.87 | 98.1 |
| 913 | 0.13 | 0.09 | 0.06 | 81.82 | 59.09 | 417.67 | 913 | 0.13 | 0.08 | 0.06 | 81.82 | 59.09 | 426.3 | ||||||||||||||
| 918 | 0.65 | 0.55 | 0.36 | 81.82 | 59.09 | 65.24 | 918 | 0.67 | 0.55 | 0.36 | 81.82 | 59.09 | 65.35 | ||||||||||||||
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 1107) | Sum on 1 analyzed binary loop (exec - 918) | Sum on 1 analyzed binary loop (exec - 1107) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
inner_product.hpp: 82 - 2.32 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 802 | 0.62 | 0.48 | 0.31 | 85.71 | 89.29 | 99.58 | 1102 | 0.55 | 0.46 | 0.30 | 100 | 100 | 103.7 | 802 | 0.58 | 0.48 | 0.32 | 85.71 | 89.29 | 97.96 | 1108 | 0.39 | 0.30 | 0.20 | 100 | 100 | 39.57 |
| 919 | 0.39 | 0.30 | 0.20 | 85.71 | 89.29 | 40.1 | 1108 | 0.40 | 0.30 | 0.20 | 100 | 100 | 40.23 | 905 | 0.06 | 0.02 | 0.01 | 85.71 | 89.29 | 84.62 | 1117 | 0.05 | 0.02 | 0.01 | 100 | 100 | 86.5 |
| 912 | 0.14 | 0.10 | 0.06 | 85.71 | 89.29 | 125.38 | 1117 | 0.04 | 0.02 | 0.01 | 100 | 100 | 91.59 | 919 | 0.39 | 0.30 | 0.20 | 85.71 | 89.29 | 40.01 | 1102 | 0.59 | 0.45 | 0.29 | 100 | 100 | 107.44 |
| 905 | 0.04 | 0.02 | 0.02 | 85.71 | 89.29 | 88.36 | 1097 | 0.21 | 0.09 | 0.06 | 100 | 100 | 127.68 | 912 | 0.15 | 0.09 | 0.06 | 85.71 | 89.29 | 128.67 | 1097 | 0.16 | 0.09 | 0.06 | 100 | 100 | 130.59 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
BsplineFunctor.h: 291 - 1.37 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 272 | 0.65 | 0.56 | 0.37 | 0 | 22.92 | 44.45 | 550 | 0.05 | 0.02 | 0.01 | 0 | 18.88 | 55.37 | 272 | 0.69 | 0.55 | 0.36 | 0 | 22.92 | 44.71 | 550 | 0.05 | 0.02 | 0.01 | 0 | 18.88 | 58.53 |
| 233 | 0.41 | 0.29 | 0.19 | 0 | 18.88 | 56.53 | 233 | 0.37 | 0.30 | 0.20 | 0 | 18.88 | 54.28 | ||||||||||||||
| 525 | 0.21 | 0.16 | 0.11 | 0 | 19.32 | 50.75 | 525 | 0.24 | 0.18 | 0.11 | 0 | 19.32 | 48.76 | ||||||||||||||
| Sum on 1 analyzed binary loop (exec - 272) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Loop Computation Issues | |||||||||||||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||||
| Control Flow Issues | |||||||||||||||||||||||||||
| Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||
| Data Access Issues | |||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||||||||||||
| Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||
TwoBodyJastrowRef.h: 155 - 1.31 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 302 | 0.22 | 0.16 | 0.11 | 85.71 | 89.29 | 148.52 | 229 | 0.60 | 0.49 | 0.32 | 100 | 100 | 146.34 | 302 | 0.26 | 0.15 | 0.10 | 85.71 | 89.29 | 156.47 | 229 | 0.69 | 0.49 | 0.32 | 100 | 100 | 147.29 |
| 301 | 0.22 | 0.17 | 0.11 | 100 | 100 | 142.6 | 301 | 0.25 | 0.17 | 0.11 | 100 | 100 | 143.35 | ||||||||||||||
| 300 | 0.25 | 0.18 | 0.12 | 85.71 | 89.29 | 136.16 | 300 | 0.25 | 0.18 | 0.12 | 85.71 | 89.29 | 136.36 | ||||||||||||||
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
MultiBsplineRef.hpp: 276 - 1.12 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 816 | 0.32 | 0.23 | 0.15 | 100 | 100 | 288.86 | 944 | 0.74 | 0.62 | 0.41 | 0 | 25 | 106.88 | 816 | 0.31 | 0.23 | 0.15 | 100 | 100 | 280.42 | 944 | 0.75 | 0.63 | 0.41 | 0 | 25 | 104.99 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 944) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 944) | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||
| Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||
einspline_spo_ref.hpp: 223 - 0.90 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 801 | 0.59 | 0.48 | 0.32 | 9.09 | 26.14 | 0 | 943 | 0.32 | 0.21 | 0.14 | 11.11 | 27.78 | 0 | 801 | 0.57 | 0.47 | 0.31 | 9.09 | 26.14 | 0 | 943 | 0.30 | 0.20 | 0.13 | 11.11 | 27.78 | 0 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
TwoBodyJastrowRef.h: 324 - 0.75 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 316 | 0.35 | 0.27 | 0.18 | 97.14 | 97.86 | 203.48 | 527 | 0.38 | 0.30 | 0.20 | 0 | 25 | 178.66 | 316 | 0.39 | 0.27 | 0.18 | 97.14 | 97.86 | 199.69 | 527 | 0.41 | 0.30 | 0.19 | 0 | 25 | 182.47 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
inner_product.hpp: 211 - 0.24 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 892 | 0.12 | 0.10 | 0.06 | 0 | 25 | 0 | 1126 | 0.10 | 0.09 | 0.06 | 14.29 | 28.57 | 0 | 892 | 0.11 | 0.09 | 0.06 | 0 | 25 | 0 | 1126 | 0.10 | 0.09 | 0.06 | 14.29 | 28.57 | 0 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
OneBodyJastrowRef.h: 192 - 0.14 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 225 | 0.04 | 0.02 | 0.01 | 100 | 100 | 79.47 | 752 | 0.10 | 0.06 | 0.04 | 100 | 100 | 101.6 | 224 | 0.06 | 0.03 | 0.02 | 85.71 | 89.29 | 74.28 | 752 | 0.13 | 0.07 | 0.04 | 100 | 100 | 100.12 |
| 224 | 0.05 | 0.03 | 0.02 | 85.71 | 89.29 | 72.76 | |||||||||||||||||||||
| 226 | 0.04 | 0.02 | 0.01 | 85.71 | 89.29 | 70.98 | |||||||||||||||||||||
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
BsplineFunctor.h: 246 - 0.14 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 326 | 0.09 | 0.05 | 0.03 | 75 | 93.75 | 797.91 | 296 | 0.09 | 0.05 | 0.03 | 75.56 | 100 | 775.7 | 326 | 0.10 | 0.06 | 0.04 | 75 | 93.75 | 690.67 | 296 | 0.09 | 0.05 | 0.03 | 75.56 | 100 | 783.89 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
stl_numeric.h: 140 - 0.06 %
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_1 | ||||||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 307 | 0.06 | 0.02 | 0.02 | 80 | 85 | 415.32 | 231 | 0.05 | 0.02 | 0.01 | 100 | 100 | 377.05 | 307 | 0.05 | 0.02 | 0.02 | 80 | 85 | 374.94 | 231 | 0.05 | 0.02 | 0.01 | 100 | 100 | 367.54 |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | ||||||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||

