Run gcc/gcc-O3/points_100000000 | Run acfl/acfl-O3/points_100000000 |
Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 60-65
| Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 61-67
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
4 | 85.75 | 85.75 | 89.57 | 18.18 | 52.27 | 8.52 | 9 | 101.55 | 101.55 | 91.78 | 10.8 | 53.13 | 4 |
| |
Sum on 1 analyzed binary loop (kmeans-gcc-O3 - 4) | Sum on 1 analyzed binary loop (kmeans-acfl-O3 - 9) |
Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | |
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | |
Control Flow Issues | | Control Flow Issues | |
Presence of 2 to 4 paths | 0 | Presence of 2 to 4 paths | 1 |
Vectorization Roadblocks | | Vectorization Roadblocks | |
Presence of 2 to 4 paths | 0 | Presence of 2 to 4 paths | 1 |
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 |
Run gcc/gcc-O3/points_100000000 | Run acfl/acfl-O3/points_100000000 |
Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 81-84
| Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 81-84
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
11 | 7.39 | 7.39 | 7.72 | 10 | 47.5 | 0.68 | 7 | 7.88 | 7.88 | 7.12 | 11.11 | 52.78 | 0.63 |
| |
Sum on 1 analyzed binary loop (kmeans-gcc-O3 - 11) | Sum on 1 analyzed binary loop (kmeans-acfl-O3 - 7) |
Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 |
Data Access Issues | | Data Access Issues | |
Presence of indirect access | 1 | Presence of indirect access | 1 |
Vectorization Roadblocks | | Vectorization Roadblocks | |
Presence of indirect access | 1 | Presence of indirect access | 1 |