Loop Id: 463 | Module: exec | Source: par_coarsen.c:2133-2136 | Coverage: 3.01% |
---|
Loop Id: 463 | Module: exec | Source: par_coarsen.c:2133-2136 | Coverage: 3.01% |
---|
0x421308 LDR X12, [X19] [5] |
0x42130c UBFM X13, X9, #61, #60 |
0x421310 LDR X11, [X20] [1] |
0x421314 ADD X9, X9, #2 |
0x421318 CMP X8, X9 |
0x42131c LDR X12, [X12, X13] [4] |
0x421320 ADD X11, X11, X12,LSL #3 |
0x421324 LDADD X10, X11, [X11] [2] |
0x421328 LDR X12, [X19] [5] |
0x42132c LDR X11, [X20] [1] |
0x421330 ADD X12, X12, X13 |
0x421334 LDR X12, [X12, #8] [3] |
0x421338 ADD X11, X11, X12,LSL #3 |
0x42133c LDADD X10, X11, [X11] [2] |
0x421340 B.NE 421308 |
/home/hbollore/qaas/qaas-runs/169-817-3176/intel/AMG/build/AMG/AMG/parcsr_ls/par_coarsen.c: 2133 - 2136 |
-------------------------------------------------------------------------------- |
2133: for (i=0; i < S_diag_i[num_variables]; i++) |
2134: { |
2135: #pragma omp atomic |
2136: measure_array_temp[S_diag_j[i]]++; |
Coverage (%) | Name | Source Location | Module |
---|---|---|---|
○100.00 | __kmp_invoke_microtask | libomp.so |
Path / |
Metric | Value |
---|---|
CQA speedup if no scalar integer | NA |
CQA speedup if FP arith vectorized | NA |
CQA speedup if fully vectorized | NA |
CQA speedup if no inter-iteration dependency | NA |
CQA speedup if next bottleneck killed | NA |
Bottlenecks | NA |
Function | .omp_outlined..11 |
Source | par_coarsen.c:2133-2136 |
Source loop unroll info | NA |
Source loop unroll confidence level | NA |
Unroll/vectorization loop type | NA |
Unroll factor | NA |
CQA cycles | NA |
CQA cycles if no scalar integer | NA |
CQA cycles if FP arith vectorized | NA |
CQA cycles if fully vectorized | NA |
Front-end cycles | NA |
DIV/SQRT cycles | NA |
P0 cycles | NA |
P1 cycles | NA |
P2 cycles | NA |
P3 cycles | NA |
P4 cycles | NA |
P5 cycles | NA |
P6 cycles | NA |
P7 cycles | NA |
P8 cycles | NA |
P9 cycles | NA |
P10 cycles | NA |
P11 cycles | NA |
P12 cycles | NA |
P13 cycles | NA |
P14 cycles | NA |
Inter-iter dependencies cycles | NA |
FE+BE cycles (UFS) | NA |
Stall cycles (UFS) | NA |
Nb insns | NA |
Nb uops | NA |
Nb loads | NA |
Nb stores | NA |
Nb stack references | NA |
FLOP/cycle | NA |
Nb FLOP add-sub | NA |
Nb FLOP mul | NA |
Nb FLOP fma | NA |
Nb FLOP div | NA |
Nb FLOP rcp | NA |
Nb FLOP sqrt | NA |
Nb FLOP rsqrt | NA |
Bytes/cycle | NA |
Bytes prefetched | NA |
Bytes loaded | NA |
Bytes stored | NA |
Stride 0 | NA |
Stride 1 | NA |
Stride n | NA |
Stride unknown | NA |
Stride indirect | NA |
Vectorization ratio all | NA |
Vectorization ratio load | NA |
Vectorization ratio store | NA |
Vectorization ratio mul | NA |
Vectorization ratio add_sub | NA |
Vectorization ratio fma | NA |
Vectorization ratio div_sqrt | NA |
Vectorization ratio other | NA |
Vector-efficiency ratio all | NA |
Vector-efficiency ratio load | NA |
Vector-efficiency ratio store | NA |
Vector-efficiency ratio mul | NA |
Vector-efficiency ratio add_sub | NA |
Vector-efficiency ratio fma | NA |
Vector-efficiency ratio div_sqrt | NA |
Vector-efficiency ratio other | NA |
Path / |
Function | .omp_outlined..11 |
Source file and lines | par_coarsen.c:2133-2136 |
Module | exec |