| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 108-113
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 108-113
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 108-113
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 108-113
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 19 | 116.31 | 114.92 | 43.27 | 0 | 25 | 35.42 | 26 | 116.32 | 115.23 | 43.21 | 75 | 43.75 | 35.22 | 29 | 116.27 | 115.19 | 43.16 | 100 | 100 | 35.23 | 19 | 116.97 | 115.76 | 43.45 | 0 | 25 | 35.13 |
| | | |
| Sum on 1 analyzed binary loop (exec - 19) | Sum on 1 analyzed binary loop (exec - 26) | Sum on 1 analyzed binary loop (exec - 29) | Sum on 1 analyzed binary loop (exec - 19) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | Data Access Issues | | Data Access Issues | |
| Presence of constant non-unit stride data access | | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | |
| Presence of constant non-unit stride data access | | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 86-90
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 86-90
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 86-90
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 86-90
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 16 | 89.56 | 87.14 | 32.81 | 4.35 | 26.09 | 116.43 | 21 | 88.79 | 87.84 | 32.94 | 88.46 | 47.12 | 123.17 | 25 | 88.93 | 87.22 | 32.68 | 95.45 | 100 | 116.16 | 16 | 87.86 | 86.95 | 32.63 | 0 | 25 | 116.74 |
| | | |
| Sum on 1 analyzed binary loop (exec - 16) | Sum on 1 analyzed binary loop (exec - 21) | Sum on 1 analyzed binary loop (exec - 25) | Sum on 1 analyzed binary loop (exec - 16) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | Data Access Issues | | Data Access Issues | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 128-131
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 128-131
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 128-131
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 128-131
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 23 | 55.12 | 54.41 | 20.49 | 83.33 | 70.83 | 24.85 | 30 | 55.84 | 55.32 | 20.75 | 100 | 50 | 24.53 | 33 | 55.70 | 55.02 | 20.62 | 88.89 | 91.67 | 24.58 | 20 | 56.06 | 55.38 | 20.78 | 0 | 25 | 24.52 |
| | | |
| Sum on 1 analyzed binary loop (exec - 23) | Sum on 1 analyzed binary loop (exec - 30) | Sum on 1 analyzed binary loop (exec - 33) | Sum on 1 analyzed binary loop (exec - 20) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | Data Access Issues | | Data Access Issues | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 0 | 0.09 | 0.02 | 0.01 | 0 | 0 | 4.01 | 14 | 0.05 | 0.02 | 0.01 | 0 | 0 | 2.45 | 0 | 0.07 | 0.02 | 0.01 | 0 | 0 | 5.56 | 3 | 0.04 | 0.01 | 0.00 | 0 | 0 | 38.42 |
| 132 | 0.03 | 0.00 | 0.00 | 0 | 0 | 411.41 | 149 | 0.04 | 0.00 | 0.00 | 0 | 0 | 355.82 | 168 | 0.03 | 0.00 | 0.00 | 0 | 0 | 479.98 | 6 | 0.03 | 0.00 | 0.00 | 0 | 0 | 0 |
| 11 | 0.04 | 0.00 | 0.00 | 0 | 0 | 50.1 | 91 | 0.04 | 0.01 | 0.01 | 0 | 0 | 0 | 155 | 0.04 | 0.00 | 0.00 | 0 | 0 | 0 | 9 | 0.04 | 0.01 | 0.00 | 0 | 0 | 3.87 |
| 79 | 0.05 | 0.01 | 0.00 | 0 | 0 | 0 | 4 | 0.03 | 0.00 | 0.00 | 0 | 0 | 53.31 | 103 | 0.05 | 0.01 | 0.01 | 0 | 0 | 0 | 14 | 0.07 | 0.02 | 0.01 | 0 | 0 | 3.25 |
| 131 | 0.03 | 0.00 | 0.00 | 0 | 0 | 0 | 144 | 0.03 | 0.00 | 0.00 | 0 | 0 | 4.44 | 7 | 0.03 | 0.00 | 0.00 | 0 | 0 | 0 | 195 | 0.04 | 0.00 | 0.00 | 0 | 0 | 0 |
| 63 | 0.08 | 0.00 | 0.00 | 0 | 0 | 122.66 | 23 | 0.05 | 0.01 | 0.00 | 0 | 0 | 0 | 83 | 0.08 | 0.00 | 0.00 | 0 | 0 | 122.66 | 101 | 0.03 | 0.00 | 0.00 | 0 | 0 | 477.12 |
| 125 | 0.04 | 0.01 | 0.00 | 0 | 0 | 69.67 | 71 | 0.08 | 0.00 | 0.00 | 0 | 0 | 121.54 | 167 | 0.03 | 0.00 | 0.00 | 0 | 0 | 8.89 | 91 | 0.04 | 0.02 | 0.01 | 0 | 0 | 69.75 |
| 76 | 0.04 | 0.00 | 0.00 | 0 | 0 | 0 | 93 | 0.05 | 0.01 | 0.00 | 0 | 0 | 0 | 21 | 0.04 | 0.02 | 0.01 | 0 | 0 | 69.81 | 47 | 0.06 | 0.00 | 0.00 | 0 | 0 | 153.27 |
| 138 | 0.04 | 0.02 | 0.01 | 0 | 0 | 67.56 | 100 | 0.05 | 0.01 | 0.00 | 0 | 0 | 0 | 90 | 0.03 | 0.00 | 0.00 | 0 | 0 | 0 |
| | 158 | 0.05 | 0.02 | 0.01 | 0 | 0 | 62.45 | 61 | 0.03 | 0.00 | 0.00 | 0 | 0 | 0 |
| | 225 | 0.04 | 0.00 | 0.00 | 0 | 0 | 0 | 96 | 0.03 | 0.00 | 0.00 | 0 | 0 | 5.33 |
| | 15 | 0.04 | 0.01 | 0.00 | 0 | 0 | 47.76 | |
| | | |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 13-15
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 13-15
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 13-15
| Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 70 | 0.14 | 0.09 | 0.03 | 0 | 25 | 0 | 80 | 0.16 | 0.08 | 0.03 | 0 | 25 | 0 | 93 | 0.14 | 0.10 | 0.04 | 0 | 25 | 0 | |
| | | |
| Sum on 1 analyzed binary loop (exec - 70) | Sum on 1 analyzed binary loop (exec - 80) | Sum on 1 analyzed binary loop (exec - 93) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | Data Access Issues | | | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | | |
| Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | | | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | | |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 28-30
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 28-30
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/local_halos.cpp: 28-30
| Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 72 | 0.07 | 0.04 | 0.01 | 0 | 25 | 0 | 85 | 0.07 | 0.04 | 0.01 | 0 | 25 | 0 | 96 | 0.10 | 0.05 | 0.02 | 0 | 25 | 0 | |
| | | |
| Sum on 1 analyzed binary loop (exec - 72) | Sum on 1 analyzed binary loop (exec - 85) | Sum on 1 analyzed binary loop (exec - 96) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | Data Access Issues | | | |
| Presence of constant non-unit stride data access | | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | | | |
| Vectorization Roadblocks | | Vectorization Roadblocks | | Vectorization Roadblocks | | | |
| Presence of constant non-unit stride data access | | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | | | |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 62-68
| Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 62-68
| Loop Source Regions | | Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 62-68
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 13 | 0.05 | 0.05 | 0.02 | 3.7 | 25.93 | 57.96 | 17 | 0.05 | 0.03 | 0.01 | 0 | 25 | 69.6 | | 13 | 0.05 | 0.03 | 0.01 | 0 | 25 | 78.84 |
| | | |
| Sum on 1 analyzed binary loop (exec - 13) | Sum on 1 analyzed binary loop (exec - 17) | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 13) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Data Access Issues | | Data Access Issues | | | | Data Access Issues | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | | | Presence of constant non-unit stride data access | 1 |
| Vectorization Roadblocks | | Vectorization Roadblocks | | | | Vectorization Roadblocks | |
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | | | Presence of constant non-unit stride data access | 1 |
| Run orig_default | Run gcc_default | Run armclang_1 | Run gcc_5 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | - /home/eoseret/qaas/qaas_runs/178-237-4322/intel/TeaLeaf/build/TeaLeaf/src/omp/cg.cpp: 105-105
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| | | 17 | 0.11 | 0.04 | 0.02 | 0 | 25 | 0.03 |
| | | |
| No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | No Loops Overview analysis found for any assembly loop. More loops can be analyzed using option --summary-loop-count. | Sum on 1 analyzed binary loop (exec - 17) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| | | | | | Loop Computation Issues | |
| | | | | | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |