Loops
MultiBsplineRef.hpp: 68 - 91.13 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
823 | 8.52 | 8.09 | 5.91 | 90 | 95 | 1111 | 31.39 | 30.93 | 22.33 | 100 | 100 | 866 | 8.66 | 8.17 | 5.91 | 90 | 95 | 961 | 31.02 | 30.32 | 21.98 | 2.33 | 51.16 |
822 | 8.58 | 8.15 | 5.95 | 90 | 95 | 869 | 8.44 | 7.99 | 5.77 | 90 | 95 | ||||||||||||
824 | 8.35 | 7.98 | 5.83 | 90 | 95 | 867 | 8.51 | 8.11 | 5.86 | 90 | 95 | ||||||||||||
825 | 8.41 | 7.96 | 5.81 | 90 | 95 | 868 | 8.44 | 8.00 | 5.79 | 90 | 95 | ||||||||||||
Sum on 4 analyzed binary loops (exec - 823, exec - 822, exec - 824, exec - 825) | Sum on 1 analyzed binary loop (exec - 1111) | Sum on 4 analyzed binary loops (exec - 866, exec - 869, exec - 867, exec - 868) | Sum on 1 analyzed binary loop (exec - 961) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | ||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
SoaDistanceTableAAOMPTarget.h: 440 - 37.16 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1898 | 13.62 | 13.10 | 9.57 | 0 | 50 | 3292 | 13.13 | 12.61 | 9.10 | 0 | 50 | 1942 | 13.24 | 12.94 | 9.36 | 0 | 50 | 3062 | 13.02 | 12.60 | 9.13 | 0 | 50 |
Sum on 1 analyzed binary loop (exec - 1898) | Sum on 1 analyzed binary loop (exec - 3292) | Sum on 1 analyzed binary loop (exec - 1942) | Sum on 1 analyzed binary loop (exec - 3062) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||
Presence of indirect access | 0 | Presence of indirect access | 1 | Presence of indirect access | 0 | Presence of indirect access | 1 | ||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||
Presence of indirect access | 0 | Presence of indirect access | 1 | Presence of indirect access | 0 | Presence of indirect access | 1 |
MultiBsplineRef.hpp: 242 - 36.90 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
833 | 12.64 | 12.14 | 8.86 | 97.92 | 98.96 | 1154 | 13.73 | 13.38 | 9.66 | 0 | 50 | 877 | 12.41 | 12.05 | 8.72 | 97.92 | 98.96 | 1002 | 13.63 | 13.33 | 9.66 | 0 | 50 |
Sum on 1 analyzed binary loop (exec - 833) | Sum on 1 analyzed binary loop (exec - 1154) | Sum on 1 analyzed binary loop (exec - 877) | Sum on 1 analyzed binary loop (exec - 1002) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | ||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access |
BsplineFunctor.h: 236 - 4.38 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
268 | 0.07 | 0.04 | 0.03 | 0 | 45.83 | 391 | 3.22 | 2.98 | 2.15 | 0 | 40.38 | 312 | 0.07 | 0.04 | 0.03 | 0 | 45.83 | 261 | 0.08 | 0.04 | 0.03 | 0 | 40.38 |
279 | 0.10 | 0.04 | 0.03 | 0 | 40.38 | 348 | 3.11 | 2.91 | 2.11 | 0 | 40.38 | ||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 391) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 348) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||
Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 |
inner_product.hpp: 155 - 3.57 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
934 | 0.76 | 0.56 | 0.41 | 90.91 | 95.45 | 1339 | 0.09 | 0.05 | 0.04 | 100 | 100 | 964 | 0.14 | 0.09 | 0.07 | 90 | 95 | 1175 | 0.62 | 0.51 | 0.37 | 16.67 | 58.33 |
925 | 0.69 | 0.52 | 0.38 | 90 | 95 | 1354 | 0.77 | 0.54 | 0.39 | 100 | 100 | 969 | 0.67 | 0.52 | 0.38 | 90 | 95 | 1180 | 0.80 | 0.57 | 0.42 | 16.67 | 58.33 |
920 | 0.14 | 0.10 | 0.07 | 90 | 95 | 1348 | 0.66 | 0.48 | 0.35 | 100 | 100 | 972 | 0.12 | 0.08 | 0.06 | 90 | 95 | 1194 | 0.14 | 0.09 | 0.06 | 16.67 | 58.33 |
928 | 0.14 | 0.08 | 0.06 | 90 | 95 | 1371 | 0.13 | 0.09 | 0.06 | 100 | 100 | 978 | 0.70 | 0.56 | 0.41 | 90.91 | 95.45 | 1167 | 0.14 | 0.08 | 0.06 | 16.67 | 58.33 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1354) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
Vectorization Roadblocks | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 |
TwoBodyJastrowRef.h: 155 - 1.67 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
294 | 0.28 | 0.20 | 0.14 | 100 | 100 | 315 | 0.73 | 0.52 | 0.37 | 100 | 100 | 338 | 0.28 | 0.19 | 0.14 | 100 | 100 | 293 | 0.91 | 0.74 | 0.54 | 0 | 50 |
293 | 0.26 | 0.17 | 0.12 | 100 | 100 | 336 | 0.23 | 0.16 | 0.12 | 100 | 100 | ||||||||||||
292 | 0.23 | 0.16 | 0.12 | 100 | 100 | 337 | 0.24 | 0.16 | 0.12 | 100 | 100 | ||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 293) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
Vectorization Roadblocks | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 |
BsplineFunctor.h: 291 - 1.59 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
264 | 0.75 | 0.60 | 0.44 | 0 | 41.67 | 688 | 0.06 | 0.03 | 0.02 | 0 | 38.64 | 308 | 0.73 | 0.59 | 0.42 | 0 | 41.67 | 297 | 0.39 | 0.29 | 0.21 | 0 | 39.63 |
677 | 0.25 | 0.19 | 0.13 | 0 | 38.04 | 607 | 0.06 | 0.03 | 0.02 | 0 | 38.64 | ||||||||||||
320 | 0.39 | 0.28 | 0.20 | 0 | 39.63 | 597 | 0.26 | 0.19 | 0.14 | 0 | 38.04 | ||||||||||||
Sum on 1 analyzed binary loop (exec - 264) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 308) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||
Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 |
inner_product.hpp: 82 - 1.43 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
816 | 0.18 | 0.12 | 0.09 | 100 | 100 | 1370 | 0.05 | 0.03 | 0.02 | 100 | 100 | 977 | 0.43 | 0.31 | 0.23 | 100 | 100 | 1179 | 0.43 | 0.31 | 0.23 | 0 | 50 |
919 | 0.07 | 0.03 | 0.02 | 100 | 100 | 1346 | 0.23 | 0.16 | 0.12 | 100 | 100 | 971 | 0.10 | 0.04 | 0.03 | 100 | 100 | 1195 | 0.07 | 0.03 | 0.02 | 0 | 50 |
927 | 0.08 | 0.04 | 0.03 | 100 | 100 | 1340 | 0.07 | 0.04 | 0.03 | 100 | 100 | 963 | 0.06 | 0.03 | 0.02 | 100 | 100 | 1173 | 0.40 | 0.32 | 0.23 | 0 | 50 |
933 | 0.43 | 0.32 | 0.23 | 100 | 100 | 860 | 0.17 | 0.12 | 0.09 | 100 | 100 | 1168 | 0.13 | 0.07 | 0.05 | 0 | 50 | ||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
MultiBsplineRef.hpp: 276 - 1.37 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
836 | 0.52 | 0.38 | 0.27 | 96.43 | 98.21 | 1150 | 0.68 | 0.56 | 0.41 | 0 | 50 | 880 | 0.47 | 0.38 | 0.27 | 96.43 | 98.21 | 1001 | 0.73 | 0.58 | 0.42 | 0 | 50 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1150) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1001) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
einspline_spo_ref.hpp: 223 - 1.32 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
814 | 0.72 | 0.58 | 0.42 | 9.09 | 52.27 | 1158 | 0.43 | 0.33 | 0.24 | 11.11 | 55.56 | 858 | 0.70 | 0.58 | 0.42 | 9.09 | 52.27 | 1006 | 0.43 | 0.34 | 0.24 | 11.11 | 55.56 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
TwoBodyJastrowRef.h: 324 - 0.94 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
309 | 0.42 | 0.33 | 0.24 | 96.55 | 98.28 | 682 | 0.47 | 0.33 | 0.24 | 0 | 50 | 353 | 0.38 | 0.31 | 0.23 | 96.55 | 98.28 | 602 | 0.42 | 0.32 | 0.23 | 0 | 50 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
inner_product.hpp: 211 - 0.26 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
904 | 0.10 | 0.09 | 0.07 | 0 | 50 | 1379 | 0.09 | 0.09 | 0.06 | 14.29 | 57.14 | 948 | 0.10 | 0.09 | 0.07 | 0 | 50 | 1200 | 0.09 | 0.08 | 0.06 | 14.29 | 57.14 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
TwoBodyJastrowRef.h: 381 - 0.18 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
342 | 0.05 | 0.02 | 0.01 | 100 | 100 | 722 | 0.13 | 0.06 | 0.04 | 100 | 100 | 386 | 0.06 | 0.02 | 0.01 | 100 | 100 | 630 | 0.13 | 0.07 | 0.05 | 0 | 50 |
343 | 0.04 | 0.02 | 0.01 | 100 | 100 | 387 | 0.04 | 0.02 | 0.01 | 100 | 100 | ||||||||||||
341 | 0.05 | 0.02 | 0.02 | 100 | 100 | 385 | 0.05 | 0.02 | 0.02 | 100 | 100 | ||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
BsplineFunctor.h: 246 - 0.15 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
319 | 0.07 | 0.05 | 0.03 | 52.38 | 78.57 | 392 | 0.08 | 0.05 | 0.04 | 75.56 | 100 | 363 | 0.08 | 0.05 | 0.03 | 52.38 | 78.57 | 349 | 0.12 | 0.07 | 0.05 | 7.25 | 53.62 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
TwoBodyJastrowRef.h: 388 - 0.07 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
340 | 0.05 | 0.03 | 0.02 | 94.12 | 97.06 | 719 | 0.04 | 0.02 | 0.01 | 100 | 100 | 384 | 0.05 | 0.02 | 0.02 | 94.12 | 97.06 | 627 | 0.05 | 0.03 | 0.02 | 0 | 50 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |