Loop Id: 169 | Module: exec | Source: par_coarsen.c:2135-2136 | Coverage: 2.89% |
---|
Loop Id: 169 | Module: exec | Source: par_coarsen.c:2135-2136 | Coverage: 2.89% |
---|
0x4225b0 ORR X11, XZR, X0 |
0x4225b4 LDR X12, [X11], #8 [4] |
0x4225b8 ADD X13, X9, X12,LSL #3 |
0x4225bc LDADD X8, X1, [X13] [8] |
0x4225c0 LDR X14, [X0, #8] [10] |
0x4225c4 ADD X15, X9, X14,LSL #3 |
0x4225c8 LDADD X8, X1, [X15] [5] |
0x4225cc LDR X16, [X11, #8] [4] |
0x4225d0 ADD X17, X9, X16,LSL #3 |
0x4225d4 LDADD X8, X1, [X17] [6] |
0x4225d8 LDR X18, [X0, #24] [10] |
0x4225dc ADD X30, X9, X18,LSL #3 |
0x4225e0 LDADD X8, X1, [X30] [3] |
0x4225e4 LDR X20, [X0, #32] [10] |
0x4225e8 ADD X1, X9, X20,LSL #3 |
0x4225ec LDADD X8, X1, [X1] [7] |
0x4225f0 LDR X21, [X0, #40] [10] |
0x4225f4 ADD X19, X9, X21,LSL #3 |
0x4225f8 LDADD X8, X1, [X19] [9] |
0x4225fc LDR X4, [X0, #48] [10] |
0x422600 ADD X3, X9, X4,LSL #3 |
0x422604 LDADD X8, X1, [X3] [1] |
0x422608 LDR X2, [X0, #56] [10] |
0x42260c ADD X6, X9, X2,LSL #3 |
0x422610 LDADD X8, X1, [X6] [2] |
0x422614 ADD X0, X0, #64 |
0x422618 CMP X10, X0 |
0x42261c B.NE 4225b0 |
/home/hbollore/qaas/qaas-runs/169-817-3176/intel/AMG/build/AMG/AMG/parcsr_ls/par_coarsen.c: 2135 - 2136 |
-------------------------------------------------------------------------------- |
2135: #pragma omp atomic |
2136: measure_array_temp[S_diag_j[i]]++; |
Coverage (%) | Name | Source Location | Module |
---|---|---|---|
►96.63+ | GOMP_parallel | libomp.so | |
○ | hypre_BoomerAMGCoarsenPMIS | par_coarsen.c:2132 | exec |
○ | hypre_BoomerAMGSetup | par_amg_setup.c:612 | exec |
○ | hypre_PCGSetup | pcg.c:234 | exec |
○ | main | amg.c:398 | exec |
○ | __libc_start_main | libc-2.31.so | |
○ | _start | amg.c:599 | exec |
►3.37+ | GOMP_parallel | libomp.so | |
○ | hypre_BoomerAMGCoarsenPMIS | par_coarsen.c:2132 | exec |
○ | hypre_BoomerAMGSetup | par_amg_setup.c:623 | exec |
○ | hypre_PCGSetup | pcg.c:234 | exec |
○ | main | amg.c:398 | exec |
○ | __libc_start_main | libc-2.31.so | |
○ | _start | amg.c:599 | exec |
Path / |
Metric | Value |
---|---|
CQA speedup if no scalar integer | NA |
CQA speedup if FP arith vectorized | NA |
CQA speedup if fully vectorized | NA |
CQA speedup if no inter-iteration dependency | NA |
CQA speedup if next bottleneck killed | NA |
Bottlenecks | NA |
Function | hypre_BoomerAMGCoarsenPMIS._omp_fn.2 |
Source | par_coarsen.c:2135-2136 |
Source loop unroll info | NA |
Source loop unroll confidence level | NA |
Unroll/vectorization loop type | NA |
Unroll factor | NA |
CQA cycles | NA |
CQA cycles if no scalar integer | NA |
CQA cycles if FP arith vectorized | NA |
CQA cycles if fully vectorized | NA |
Front-end cycles | NA |
DIV/SQRT cycles | NA |
P0 cycles | NA |
P1 cycles | NA |
P2 cycles | NA |
P3 cycles | NA |
P4 cycles | NA |
P5 cycles | NA |
P6 cycles | NA |
P7 cycles | NA |
P8 cycles | NA |
P9 cycles | NA |
P10 cycles | NA |
P11 cycles | NA |
P12 cycles | NA |
P13 cycles | NA |
P14 cycles | NA |
Inter-iter dependencies cycles | NA |
FE+BE cycles (UFS) | NA |
Stall cycles (UFS) | NA |
Nb insns | NA |
Nb uops | NA |
Nb loads | NA |
Nb stores | NA |
Nb stack references | NA |
FLOP/cycle | NA |
Nb FLOP add-sub | NA |
Nb FLOP mul | NA |
Nb FLOP fma | NA |
Nb FLOP div | NA |
Nb FLOP rcp | NA |
Nb FLOP sqrt | NA |
Nb FLOP rsqrt | NA |
Bytes/cycle | NA |
Bytes prefetched | NA |
Bytes loaded | NA |
Bytes stored | NA |
Stride 0 | NA |
Stride 1 | NA |
Stride n | NA |
Stride unknown | NA |
Stride indirect | NA |
Vectorization ratio all | NA |
Vectorization ratio load | NA |
Vectorization ratio store | NA |
Vectorization ratio mul | NA |
Vectorization ratio add_sub | NA |
Vectorization ratio fma | NA |
Vectorization ratio div_sqrt | NA |
Vectorization ratio other | NA |
Vector-efficiency ratio all | NA |
Vector-efficiency ratio load | NA |
Vector-efficiency ratio store | NA |
Vector-efficiency ratio mul | NA |
Vector-efficiency ratio add_sub | NA |
Vector-efficiency ratio fma | NA |
Vector-efficiency ratio div_sqrt | NA |
Vector-efficiency ratio other | NA |
Path / |
Function | hypre_BoomerAMGCoarsenPMIS._omp_fn.2 |
Source file and lines | par_coarsen.c:2135-2136 |
Module | exec |