Function: hypre_BoomerAMGCoarsenPMIS._omp_fn.7 | Module: exec | Source: par_coarsen.c:2435-2477 [...] | Coverage: 1.36% |
---|
Function: hypre_BoomerAMGCoarsenPMIS._omp_fn.7 | Module: exec | Source: par_coarsen.c:2435-2477 [...] | Coverage: 1.36% |
---|
/home/hbollore/qaas/qaas-runs/169-817-3176/intel/AMG/build/AMG/AMG/parcsr_ls/par_coarsen.c: 2435 - 2477 |
-------------------------------------------------------------------------------- |
2435: #pragma omp parallel for private(ig, i, jS, j) HYPRE_SMP_SCHEDULE |
[...] |
2446: if(measure_array[i]<1.) CF_marker[i]= F_PT; |
[...] |
2453: if (CF_marker[i] > 0) CF_marker[i] = C_PT; |
[...] |
2465: for (jS = S_diag_i[i]; jS < S_diag_i[i+1]; jS++) |
2466: { |
2467: /* j is the column number, or the local number of the point influencing i */ |
2468: j = S_diag_j[jS]; |
2469: if (CF_marker[j] > 0) /* j is a C-point */ |
2470: CF_marker[i] = F_PT; |
2471: } |
2472: /* now the external part */ |
2473: for (jS = S_offd_i[i]; jS < S_offd_i[i+1]; jS++) |
2474: { |
2475: j = S_offd_j[jS]; |
2476: if (CF_marker_offd[j] > 0) /* j is a C-point */ |
2477: CF_marker[i] = F_PT; |
0x4215e4 STP X29, X30, [SP, #976]! |
0x4215e8 ADD X29, SP, #0 |
0x4215ec STP X19, X20, [SP, #16] |
0x4215f0 ORR X20, XZR, X0 |
0x4215f4 LDR X0, [X0, #64] |
0x4215f8 STR X21, [SP, #32] |
0x4215fc LDR X21, [X0] |
0x421600 BL 40f400 |
0x421604 SBFM X19, X0, #0, #31 |
0x421608 BL 40f150 |
0x42160c SBFM X1, X0, #0, #31 |
0x421610 SDIV X8, X21, X19 |
0x421614 MSUB X2, X8, X19, X21 |
0x421618 CMP X1, X2 |
0x42161c B.LT 42174c |
0x421620 MADD X4, X8, X1, X2 |
0x421624 ADD X5, X8, X4 |
0x421628 CMP X4, X5 |
0x42162c B.GE 42172c |
0x421630 LDP X14, X6, [X20, #48] |
0x421634 FMOV D1, #1.0000000 |
0x421638 MOVN X9, #0 |
0x42163c MOVZ X15, #1 |
0x421640 LDP X17, X12, [X20] |
0x421644 ADD X1, X6, X4,LSL #3 |
0x421648 ADD X8, X6, X5,LSL #3 |
0x42164c LDP X16, X11, [X20, #16] |
0x421650 LDP X3, X10, [X20, #32] |
0x421654 B 421674 |
(156) 0x421658 LDR X30, [X3, X18,LSL #3] |
(156) 0x42165c CMP X30, #0 |
(156) 0x421660 B.LE 421690 |
(156) 0x421664 STR X15, [X3, X18,LSL #3] |
(156) 0x421668 ADD X1, X1, #8 |
(156) 0x42166c CMP X8, X1 |
(156) 0x421670 B.EQ 42172c |
(156) 0x421674 LDR X18, [X1] |
(156) 0x421678 LDR D0, [X14, X18,LSL #3] |
(156) 0x42167c UBFM X13, X18, #61, #60 |
(156) 0x421680 ADD X7, X3, X13 |
(156) 0x421684 FCMPE D0, D1 |
(156) 0x421688 B.GE 421658 |
(155) 0x42168c STR X9, [X3, X18,LSL #3] |
(155) 0x421690 ADD X20, X13, #8 |
(155) 0x421694 LDR X0, [X17, X18,LSL #3] |
(155) 0x421698 ADD X21, X17, X20 |
(155) 0x42169c LDR X4, [X17, X20] |
(155) 0x4216a0 CMP X0, X4 |
(155) 0x4216a4 B.GE 4216cc |
(153) 0x4216a8 LDR X19, [X12, X0,LSL #3] |
(153) 0x4216ac LDR X2, [X3, X19,LSL #3] |
(153) 0x4216b0 CMP X2, #0 |
(153) 0x4216b4 B.LE 42173c |
(153) 0x4216b8 STR X9, [X7] |
(153) 0x4216bc ADD X0, X0, #1 |
(153) 0x4216c0 LDR X4, [X21] |
(153) 0x4216c4 CMP X0, X4 |
(153) 0x4216c8 B.LT 4216a8 |
(155) 0x4216cc LDR X13, [X16, X18,LSL #3] |
(155) 0x4216d0 ADD X5, X16, X20 |
(155) 0x4216d4 LDR X30, [X16, X20] |
(155) 0x4216d8 CMP X13, X30 |
(155) 0x4216dc B.GE 421668 |
(157) 0x4216e0 LDR X6, [X11, X13,LSL #3] |
(157) 0x4216e4 LDR X18, [X10, X6,LSL #3] |
(157) 0x4216e8 CMP X18, #0 |
(157) 0x4216ec B.LE 421714 |
(158) 0x4216f0 STR X9, [X7] |
(158) 0x4216f4 ADD X13, X13, #1 |
(158) 0x4216f8 LDR X30, [X5] |
(158) 0x4216fc CMP X13, X30 |
(158) 0x421700 B.GE 421668 |
(158) 0x421704 LDR X6, [X11, X13,LSL #3] |
(158) 0x421708 LDR X18, [X10, X6,LSL #3] |
(158) 0x42170c CMP X18, #0 |
(158) 0x421710 B.GT 4216f0 |
(157) 0x421714 ADD X13, X13, #1 |
(157) 0x421718 CMP X13, X30 |
(157) 0x42171c B.LT 4216e0 |
(155) 0x421720 ADD X1, X1, #8 |
(155) 0x421724 CMP X8, X1 |
(155) 0x421728 B.NE 421674 |
(154) 0x42172c LDP X19, X20, [SP, #16] |
(154) 0x421730 LDR X21, [SP, #32] |
(154) 0x421734 LDP X29, X30, [SP], #48 |
(154) 0x421738 RET |
(152) 0x42173c ADD X0, X0, #1 |
(152) 0x421740 CMP X0, X4 |
(152) 0x421744 B.LT 4216a8 |
(152) 0x421748 B 4216cc |
0x42174c ADD X8, X8, #1 |
0x421750 MOVZ X2, #0 |
0x421754 B 421620 |
0x421758 HINT #0 |
0x42175c HINT #0 |
Coverage (%) | Name | Source Location | Module |
---|---|---|---|
►92.86+ | GOMP_parallel | libomp.so | |
○ | hypre_BoomerAMGCoarsenPMIS | par_coarsen.c:2492 | exec |
○ | hypre_BoomerAMGSetup | par_amg_setup.c:612 | exec |
○ | hypre_PCGSetup | pcg.c:234 | exec |
○ | main | amg.c:398 | exec |
○ | __libc_start_main | libc-2.31.so | |
○ | _start | amg.c:599 | exec |
►4.76+ | GOMP_parallel | libomp.so | |
○ | hypre_BoomerAMGCoarsenPMIS | par_coarsen.c:2492 | exec |
○ | hypre_BoomerAMGSetup | par_amg_setup.c:623 | exec |
○ | hypre_PCGSetup | pcg.c:234 | exec |
○ | main | amg.c:398 | exec |
○ | __libc_start_main | libc-2.31.so | |
○ | _start | amg.c:599 | exec |
►2.38+ | GOMP_parallel | libomp.so | |
○ | hypre_BoomerAMGCoarsenPMIS | par_coarsen.c:2492 | exec |
○ | hypre_BoomerAMGSetup | par_amg_setup.c:623 | exec |
○ | hypre_PCGSetup | pcg.c:234 | exec |
○ | main | amg.c:398 | exec |
○ | __libc_start_main | libc-2.31.so | |
○ | _start | amg.c:599 | exec |
Path / |
Source file and lines | par_coarsen.c:2435-2477 |
Module | exec |
nb instructions | 34 |
loop length | 136 |
nb stack references | 0 |
front end | 4.00 cycles |
P0 | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 | P10 | P11 | P12 | P13 | P14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
uops | 3.00 | 3.00 | 4.75 | 4.75 | 4.75 | 4.75 | 0.25 | 0.25 | 0.25 | 0.25 | 3.17 | 2.83 | 3.00 | 1.50 | 1.50 |
cycles | 3.00 | 3.00 | 4.75 | 4.75 | 4.75 | 4.75 | 0.25 | 0.25 | 0.25 | 0.25 | 3.17 | 2.83 | 3.00 | 1.50 | 1.50 |
Cycles executing div or sqrt instructions | 1.00-0.50 |
Front-end | 4.00 |
Overall L1 | 4.75 |
all | 0% |
load | NA (no load vectorizable/vectorized instructions) |
store | NA (no store vectorizable/vectorized instructions) |
mul | NA (no mul vectorizable/vectorized instructions) |
add-sub | 0% |
fma | 0% |
div/sqrt | NA (no div/sqrt vectorizable/vectorized instructions) |
other | 0% |
Instruction | Nb FU | P0 | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 | P10 | P11 | P12 | P13 | P14 | Latency | Recip. throughput |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
STP X29, X30, [SP, #976]! | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
ADD X29, SP, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
STP X19, X20, [SP, #16] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
ORR X20, XZR, X0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDR X0, [X0, #64] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 0.33 |
STR X21, [SP, #32] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
LDR X21, [X0] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 0.33 |
BL 40f400 <@plt_start@+0x400> | 1 | 0.50 | 0.50 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
SBFM X19, X0, #0, #31 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
BL 40f150 <@plt_start@+0x150> | 1 | 0.50 | 0.50 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
SBFM X1, X0, #0, #31 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
SDIV X8, X21, X19 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5-20 | 1-0.50 |
MSUB X2, X8, X19, X21 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 |
CMP X1, X2 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.33 |
B.LT 42174c <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x168> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
MADD X4, X8, X1, X2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 |
ADD X5, X8, X4 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
CMP X4, X5 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.33 |
B.GE 42172c <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x148> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
LDP X14, X6, [X20, #48] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
FMOV D1, #1.0000000 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 2 | 0.25 |
MOVN X9, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
MOVZ X15, #1 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDP X17, X12, [X20] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
ADD X1, X6, X4,LSL #3 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
ADD X8, X6, X5,LSL #3 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDP X16, X11, [X20, #16] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
LDP X3, X10, [X20, #32] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
B 421674 <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x90> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
ADD X8, X8, #1 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
MOVZ X2, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
B 421620 <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x3c> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
HINT #0 | ||||||||||||||||||
HINT #0 |
Source file and lines | par_coarsen.c:2435-2477 |
Module | exec |
nb instructions | 34 |
loop length | 136 |
nb stack references | 0 |
front end | 4.00 cycles |
P0 | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 | P10 | P11 | P12 | P13 | P14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
uops | 3.00 | 3.00 | 4.75 | 4.75 | 4.75 | 4.75 | 0.25 | 0.25 | 0.25 | 0.25 | 3.17 | 2.83 | 3.00 | 1.50 | 1.50 |
cycles | 3.00 | 3.00 | 4.75 | 4.75 | 4.75 | 4.75 | 0.25 | 0.25 | 0.25 | 0.25 | 3.17 | 2.83 | 3.00 | 1.50 | 1.50 |
Cycles executing div or sqrt instructions | 1.00-0.50 |
Front-end | 4.00 |
Overall L1 | 4.75 |
all | 0% |
load | NA (no load vectorizable/vectorized instructions) |
store | NA (no store vectorizable/vectorized instructions) |
mul | NA (no mul vectorizable/vectorized instructions) |
add-sub | 0% |
fma | 0% |
div/sqrt | NA (no div/sqrt vectorizable/vectorized instructions) |
other | 0% |
Instruction | Nb FU | P0 | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 | P10 | P11 | P12 | P13 | P14 | Latency | Recip. throughput |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
STP X29, X30, [SP, #976]! | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
ADD X29, SP, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
STP X19, X20, [SP, #16] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
ORR X20, XZR, X0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDR X0, [X0, #64] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 0.33 |
STR X21, [SP, #32] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.50 | 0.50 | 0 | 0.50 | 0.50 | 1 | 0.50 |
LDR X21, [X0] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 0.33 |
BL 40f400 <@plt_start@+0x400> | 1 | 0.50 | 0.50 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
SBFM X19, X0, #0, #31 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
BL 40f150 <@plt_start@+0x150> | 1 | 0.50 | 0.50 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
SBFM X1, X0, #0, #31 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
SDIV X8, X21, X19 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5-20 | 1-0.50 |
MSUB X2, X8, X19, X21 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 |
CMP X1, X2 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.33 |
B.LT 42174c <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x168> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
MADD X4, X8, X1, X2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 |
ADD X5, X8, X4 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
CMP X4, X5 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.33 |
B.GE 42172c <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x148> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
LDP X14, X6, [X20, #48] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
FMOV D1, #1.0000000 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 2 | 0.25 |
MOVN X9, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
MOVZ X15, #1 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDP X17, X12, [X20] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
ADD X1, X6, X4,LSL #3 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
ADD X8, X6, X5,LSL #3 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
LDP X16, X11, [X20, #16] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
LDP X3, X10, [X20, #32] | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.33 | 0.33 | 0.33 | 0 | 0 | 4 | 1 |
B 421674 <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x90> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
ADD X8, X8, #1 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
MOVZ X2, #0 | 1 | 0 | 0 | 0.25 | 0.25 | 0.25 | 0.25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.25 |
B 421620 <hypre_BoomerAMGCoarsenPMIS._omp_fn.7+0x3c> | 1 | 0.50 | 0.50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.50 |
HINT #0 | ||||||||||||||||||
HINT #0 |
Name | Coverage (%) | Time (s) |
---|---|---|
▼hypre_BoomerAMGCoarsenPMIS._omp_fn.7– | 1.36 | 0.21 |
▼Loop 154 - par_coarsen.c:2435-2477 - exec– | 0 | 0 |
▼Loop 152 - par_coarsen.c:2446-2477 - exec– | 0.19 | 0.03 |
▼Loop 153 - par_coarsen.c:2446-2477 - exec– | 0.88 | 0.13 |
▼Loop 155 - par_coarsen.c:2446-2477 - exec– | 0.06 | 0.01 |
○Loop 156 - par_coarsen.c:2446-2453 - exec | 0.23 | 0.03 |
○Loop 157 - par_coarsen.c:2473-2476 - exec | 0 | 0 |
○Loop 158 - par_coarsen.c:2473-2477 - exec | 0 | 0 |