options

exec - 2024-04-05 17:21:11 - MAQAO 2.19.4

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)48.73
Profiled Time (s)37.90
Time in analyzed loops (%)73.6
Time in analyzed innermost loops (%)58.3
Time in user code (%)73.7
Compilation Options Score (%)100
Array Access Efficiency (%)Not Available
Potential Speedups
Perfect Flow Complexity1.02
Perfect OpenMP + MPI + Pthread1.01
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.17
No Scalar IntegerPotential Speedup1.10
Nb Loops to get 80%9
FP VectorisedPotential Speedup1.00
Nb Loops to get 80%1
Fully VectorisedPotential Speedup2.13
Nb Loops to get 80%20
FP Arithmetic OnlyPotential Speedup1.24
Nb Loops to get 80%21

CQA Potential Speedups Summary

Loop Based Profile

Innermost Loop Based Profile

Application Categorization

Compilation Options

Source ObjectIssue
exec
IJVector_parcsr.c
par_coarsen.c
par_strength.c
par_coarse_parms.c
par_lr_interp.c
vector.c
random.c
csr_matvec.c
IJMatrix_parcsr.c
amg.c
csr_matop.c
par_csr_matop.c
ams.c
par_interp.c
par_multi_interp.c

Loop Path Count Profile

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If FP Arithmetic Only

Experiment Summary

Application/home/hbollore/qaas-runs/171-233-5044/intel/AMG/run/binaries/gcc_4/exec
Timestamp2024-04-05 17:21:11 Universal Timestamp1712337671
Number of processes observed1 Number of threads observed64
Experiment TypeMPI; OpenMP;
Machineip-172-31-42-13
Architectureaarch64 Micro ArchitectureARM_NEOVERSE_V1
OS VersionLinux 6.5.0-1016-aws #16~22.04.1-Ubuntu SMP Wed Mar 13 20:57:51 UTC 2024
Architecture used during static analysisaarch64 Micro Architecture used during static analysisARM_NEOVERSE_V1
Frequency DriverNA Frequency GovernorNA
Huge Pagesmadvise Hyperthreadingoff
Number of sockets1 Number of cores per socket64
Compilation Optionsexec: GNU C17 11.4.0 -mlittle-endian -mabi=lp64 -mcpu=zeus+crypto+sha3+sm4+noprofile -g -O3 -O3 -fno-tree-vectorize -fno-openmp-simd -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection

Configuration Summary

Dataset
Run Command<executable> -n 400 400 400
MPI Commandmpirun --bind-to socket -np 1
Number Processes1
Number Nodes1
FilterNot Used
Profile StartNot Used
×