Run gcc_5 | Run icx_1 |
Loop Source Regions | - /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/policy/loop/forall.hpp: 59-59
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/LPlusTimes.cpp: 57-57
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/util/Operators.hpp: 307-307
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/util/Operators.hpp: 304-304
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/LTimes.cpp: 62-62
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/util/View.hpp: 107-107
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/SweepSubdomain.cpp: 87-105
| Loop Source Regions | - /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/policy/loop/forall.hpp: 59-59
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/LPlusTimes.cpp: 57-57
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel.h: 45-45
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/SweepSubdomain.cpp: 87-87
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/SweepSubdomain.cpp: 95-105
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/LTimes.cpp: 62-62
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/util/Operators.hpp: 307-307
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/Population.cpp: 58-58
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
1851 | 0.20 | 0.21 | 2.44 | 0 | 12.5 | 0 | 809 | 0.03 | 0.01 | 0.06 | 100 | 50 | 0 |
1969 | 0.06 | 0.04 | 0.46 | 0 | 12.5 | 0 | 811 | 0.19 | 0.16 | 1.43 | 100 | 50 | 0 |
1908 | 0.43 | 0.43 | 5.06 | 0 | 12.5 | 0 | 745 | 0.06 | 0.00 | 0.01 | 100 | 50 | 0 |
2183 | 0.22 | 0.16 | 1.93 | 7.27 | 13.41 | 0 | 1430 | 0.15 | 0.09 | 0.81 | 0 | 12.5 | 0 |
| 963 | 0.51 | 0.32 | 2.89 | 100 | 50 | 0 |
| 961 | 0.25 | 0.14 | 1.28 | 100 | 50 | 0 |
| 1124 | 0.05 | 0.02 | 0.22 | 100 | 50 | 0 |
| |
Sum on 4 analyzed binary loops (exec - 1851, exec - 1969, exec - 1908, exec - 2183) | Sum on 6 analyzed binary loops (exec - 809, exec - 811, exec - 1430, exec - 963, exec - 961, exec - 1124) |
Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | |
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | |
Data Access Issues | | Data Access Issues | |
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 0 |
More than 10% of the vector loads instructions are unaligned | 0 | More than 10% of the vector loads instructions are unaligned | 1 |
Vectorization Roadblocks | | Vectorization Roadblocks | |
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | |
Run gcc_5 | Run icx_1 |
Loop Source Regions | | Loop Source Regions | - /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/src/Kripke/Kernel/Scattering.cpp: 91-95
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/index/IndexValue.hpp: 217-217
- /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/build/Kripke/tpl/raja/include/RAJA/util/Layout.hpp: 55-55
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 1275 | 2.24 | 1.81 | 16.24 | 0 | 12.5 | 0 |
| |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1275) |
Analysis | Count | Analysis | Count |
| | Data Access Issues | |
| | Presence of constant non-unit stride data access | 1 |
| | Vectorization Roadblocks | |
| | Presence of constant non-unit stride data access | 1 |
Run gcc_5 | Run icx_1 |
Loop Source Regions | | Loop Source Regions | |
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
1357 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 1927 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
122 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1679 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 |
2139 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 1929 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
121 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 1943 | 0.01 | 0.00 | 0.01 | 0 | 0 | 0 |
2339 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 1944 | 0.03 | 0.01 | 0.10 | 11.11 | 13.19 | 0 |
| 1259 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 |
| 525 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 |
| 1122 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
| 1372 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 |
| 756 | 0.05 | 0.00 | 0.01 | 0 | 0 | 0 |
| 751 | 0.03 | 0.00 | 0.01 | 0 | 0 | 0 |
| |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1944) |
Analysis | Count | Analysis | Count |
| | Control Flow Issues | |
| | Presence of 2 to 4 paths | 1 |
| | Data Access Issues | |
| | Presence of constant non-unit stride data access | 1 |
| | More than 10% of the vector loads instructions are unaligned | 1 |
| | Vectorization Roadblocks | |
| | Presence of 2 to 4 paths | 1 |
| | Presence of constant non-unit stride data access | 1 |