ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | ORIG / DL1 | DL1/CQA(DL1) | ORIG (cycles per iteration) | STA (ORIG) | DL1 (cycles per iteration) | STA (DL1) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only | Instance Count | min (Iteration count) | avg (Iteration count) | max (Iteration count) | min (Cycles per Iteration) | avg (Cycles per Iteration) | max (Cycles per Iteration) | Nb FP_ADD / CPI | Nb FP_MUL / CPI | CAP(FP) | BW(FP) | SAT(FP) | CAP(L1R) | BW(L1R) | SAT(L1R) | CAP(L1W) | BW(L1W) | SAT(L1W) | CAP(L2) | BW(L2) | SAT(L2) | CAP(L3) | BW(L3) | SAT(L3) | CAP(RAM_R) | CAP(RAM_W) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼Loop 5 | exec | Step10_orig.c:19-35 | Step10_orig | Single | 41.69 | 41.69 | 99.82 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.63 | 0.98 | 1.63 | 63.96 | 0.01 | 65.36 | 0.04 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 2190000 | 50 | 961 | 1872 | 39.79 | 41.11 | 3646.22 | 0.50 | 0.38 | 3.88 | 16 | 24.23 | 2.00 | 64 | 3.13 | 0.00 | 32 | 0.00 | NA | 32 | NA | NA | 15 | NA | NA | NA |
○Bucket 6 | Step10_orig.c:19-35 | Step10_orig | 99.93 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.63 | 0.98 | 1.63 | 63.96 | 0.01 | 65.36 | 0.04 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 0.50 | 0.38 | 3.88 | 16 | 24.23 | 2.00 | 64 | 3.13 | 0.00 | 32 | 0.00 | NA | 32 | NA | NA | 15 | NA | NA | NA | |||||||||||
○Bucket 9 | Step10_orig.c:19-35 | Step10_orig | 0.03 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.63 | NA | NA | NA | NA | NA | NA | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | NA | NA | NA | 16 | NA | NA | 64 | NA | NA | 32 | NA | NA | 32 | NA | NA | 15 | NA | NA | NA | |||||||||||
○Bucket 7 | Step10_orig.c:19-35 | Step10_orig | 0.02 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.63 | NA | NA | NA | NA | NA | NA | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | NA | NA | NA | 16 | NA | NA | 64 | NA | NA | 32 | NA | NA | 32 | NA | NA | 15 | NA | NA | NA | |||||||||||
○Bucket 8 | Step10_orig.c:19-35 | Step10_orig | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 45.63 | NA | NA | NA | NA | NA | NA | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | 34.00 - 40.00 | NA | NA | NA | 16 | NA | NA | 64 | NA | NA | 32 | NA | NA | 32 | NA | NA | 15 | NA | NA | NA | |||||||||||
▼Loop 1 | exec | main.c:111-116 | main | Innermost | 0.01 | 0.01 | 0.04 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 92.86 | 46.88 | 0.79 | 4.58 | 18.32 | 0.22 | 18.32 | 0.01 | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 | 730 | 49 | 960.5 | 1872 | 7 | 19.96 | 210.69 | 2.62 | 0.00 | 3.49 | 16 | 21.83 | 0.00 | 64 | 0.00 | 6.99 | 32 | 21.83 | NA | 32 | NA | NA | 15 | NA | NA | NA |
○Bucket 5 | main.c:111-116 | main | 97.46 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 92.86 | 46.88 | 0.79 | 4.58 | 18.32 | 0.22 | 18.32 | 0.01 | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 | 2.62 | 0.00 | 3.49 | 16 | 21.83 | 0.00 | 64 | 0.00 | 6.99 | 32 | 21.83 | NA | 32 | NA | NA | 15 | NA | NA | NA | |||||||||||
○Bucket 4 | main.c:111-116 | main | 0.7 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 92.86 | 46.88 | NA | NA | NA | NA | NA | NA | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 | NA | NA | NA | 16 | NA | NA | 64 | NA | NA | 32 | NA | NA | 32 | NA | NA | 15 | NA | NA | NA |