Loops
timestep.c: 74 - 5.07 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
115 | 0.03 | 0.01 | 0.09 | 6.67 | 51.67 | 150 | 0.15 | 0.13 | 1.56 | 100 | 100 | 115 | 0.02 | 0.01 | 0.09 | 6.67 | 51.67 | 149 | 0.18 | 0.13 | 1.63 | 100 | 100 |
116 | 0.23 | 0.20 | 1.70 | 75 | 87.5 | ||||||||||||||||||
Sum on 2 analyzed binary loops (exec - 115, exec - 116) | Sum on 1 analyzed binary loop (exec - 150) | Sum on 1 analyzed binary loop (exec - 115) | Sum on 1 analyzed binary loop (exec - 149) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
timestep.c: 88 - 3.99 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
119 | 0.13 | 0.08 | 0.69 | 5 | 51.25 | 146 | 0.12 | 0.09 | 1.16 | 9.09 | 53.03 | 119 | 0.13 | 0.11 | 0.99 | 5 | 51.25 | 145 | 0.13 | 0.09 | 1.15 | 9.09 | 53.03 |
Sum on 1 analyzed binary loop (exec - 119) | Sum on 1 analyzed binary loop (exec - 146) | Sum on 1 analyzed binary loop (exec - 119) | Sum on 1 analyzed binary loop (exec - 145) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||
Presence of a large number of scalar integer instructions | 0 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 0 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
haloExchange.c: 621 - 3.32 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
68 | 0.14 | 0.07 | 0.64 | 0 | 43.75 | 84 | 0.11 | 0.08 | 0.98 | 47.37 | 71.71 | 65 | 0.14 | 0.09 | 0.85 | 0 | 43.75 | 84 | 0.09 | 0.07 | 0.85 | 47.37 | 71.71 |
Sum on 1 analyzed binary loop (exec - 68) | Sum on 1 analyzed binary loop (exec - 84) | Sum on 1 analyzed binary loop (exec - 65) | Sum on 1 analyzed binary loop (exec - 84) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
haloExchange.c: 633 - 0.56 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
70 | 0.02 | 0.00 | 0.02 | 60 | 75 | 83 | 0.04 | 0.02 | 0.21 | 0 | 43.75 | 67 | 0.02 | 0.00 | 0.03 | 58.54 | 73.78 | 83 | 0.02 | 0.02 | 0.19 | 0 | 43.75 |
69 | 0.02 | 0.01 | 0.04 | 0 | 43.75 | 66 | 0.03 | 0.01 | 0.06 | 0 | 43.75 | ||||||||||||
Sum on 1 analyzed binary loop (exec - 69) | Sum on 1 analyzed binary loop (exec - 83) | Sum on 1 analyzed binary loop (exec - 66) | Sum on 1 analyzed binary loop (exec - 83) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
timestep.c: 110 - 0.21 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
122 | 0.01 | 0.00 | 0.04 | 10 | 52.5 | 155 | 0.02 | 0.01 | 0.07 | 14.29 | 54.29 | 122 | 0.02 | 0.00 | 0.04 | 16.67 | 56.25 | 154 | 0.01 | 0.01 | 0.06 | 75 | 87.5 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 122) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | |||||||||||||||||||||||
Presence of expensive FP instructions | 1 | ||||||||||||||||||||||
Data Access Issues | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
Vectorization Roadblocks | |||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 |
haloExchange.c: 380 - 0.13 %
Run orig_default | Run gcc_default | Run armclang_5 | Run gcc_1 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
48 | 0.29 | 0.00 | 0.03 | 0 | 47.06 | 47 | 0.24 | 0.00 | 0.04 | 0 | 43.75 | 48 | 0.22 | 0.00 | 0.02 | 0 | 47.06 | 47 | 0.25 | 0.00 | 0.04 | 0 | 43.75 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |