Help is available by moving the cursor above any
symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 |
---|
Total Time (s) | 42.03 | 42.83 | 42.09 | 42.37 |
Max (Thread Active Time) (s) | 18.54 | 16.02 | 21.47 | 16.08 |
Average Active Time (s) | 15.59 | 12.42 | 15.58 | 12.40 |
Activity Ratio (%) | 37.2 | 29.0 | 37.1 | 29.3 |
Average number of active threads | 35.621 | 27.834 | 35.545 | 28.102 |
Affinity Stability (%) | 95.5 | 90.8 | 95.5 | 91.7 |
Time in analyzed loops (%) | 79.2 | 98.4 | 79.0 | 98.5 |
Time in analyzed innermost loops (%) | 72.3 | 90.3 | 72.6 | 55.7 |
Time in user code (%) | 79.3 | 98.5 | 79.1 | 98.5 |
Compilation Options Score (%) | 100 | 100 | 100 | 100 |
Array Access Efficiency (%) | 60.6 | 35.1 | 65.0 | 48.9 |
|
Potential Speedups |
Perfect Flow Complexity | 1.00 | 1.02 | 1.01 | 1.02 |
Perfect OpenMP + MPI + Pthread | 1.15 | 1.01 | 1.31 | 1.01 |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.50 | 1.31 | 1.74 | 1.31 |
No Scalar Integer | Potential Speedup | 1.04 | 1.04 | 1.05 | 1.12 |
Nb Loops to get 80% | 11 | 12 | 11 | 6 |
FP Vectorised | Potential Speedup | 1.29 | 1.01 | 1.28 | 1.02 |
Nb Loops to get 80% | 4 | 5 | 3 | 6 |
Fully Vectorised | Potential Speedup | 1.37 | 1.09 | 1.30 | 1.15 |
Nb Loops to get 80% | 17 | 23 | 19 | 17 |
Only FP Arithmetic | Potential Speedup | 1.09 | 1.14 | 1.10 | 1.62 |
Nb Loops to get 80% | 25 | 23 | 23 | 13 |
Source Object | Issue |
▼exec– | |
▼par_strength.c– | |
○ | |
▼par_lr_interp.c– | |
○ | |
▼vector.c– | |
○ | |
▼par_multi_interp.c– | |
○ | |
▼csr_matvec.c– | |
○ | |
▼IJMatrix_parcsr.c– | |
○ | |
▼amg.c– | |
○ | |
▼csr_matop.c– | |
○ | |
▼par_csr_matop.c– | |
○ | |
▼random.c– | |
○ | |
▼par_coarsen.c– | |
○ | |
▼ams.c– | |
○ | |
Source Object | Issue |
▼exec– | |
▼par_strength.c– | |
○ | |
▼par_lr_interp.c– | |
○ | |
▼vector.c– | |
○ | |
▼par_coarsen.c– | |
○ | |
▼csr_matvec.c– | |
○ | |
▼IJMatrix_parcsr.c– | |
○ | |
▼amg.c– | |
○ | |
▼csr_matop.c– | |
○ | |
▼par_csr_matop.c– | |
○ | |
▼random.c– | |
○ | |
▼ams.c– | |
○ | |
▼par_multi_interp.c– | |
○ | |
Source Object | Issue |
▼exec– | |
▼par_strength.c– | |
○ | |
▼par_lr_interp.c– | |
○ | |
▼vector.c– | |
○ | |
▼random.c– | |
○ | |
▼csr_matvec.c– | |
○ | |
▼IJMatrix_parcsr.c– | |
○ | |
▼par_coarsen.c– | |
○ | |
▼csr_matop.c– | |
○ | |
▼par_csr_matop.c– | |
○ | |
▼ams.c– | |
○ | |
▼amg.c– | |
○ | |
▼par_multi_interp.c– | |
○ | |
Source Object | Issue |
▼exec– | |
▼par_strength.c– | |
○ | |
▼par_lr_interp.c– | |
○ | |
▼vector.c– | |
○ | |
▼par_coarsen.c– | |
○ | |
▼csr_matvec.c– | |
○ | |
▼IJMatrix_parcsr.c– | |
○ | |
▼amg.c– | |
○ | |
▼csr_matop.c– | |
○ | |
▼par_csr_matop.c– | |
○ | |
▼random.c– | |
○ | |
▼ams.c– | |
○ | |
▼par_multi_interp.c– | |
○ | |
| r0 | r1 | r2 | r3 |
Experiment Name | | | | |
Application | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/oneview_runs/defaults/orig/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/base_runs/defaults/gcc/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/binaries/armclang_5/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/binaries/gcc_2/exec |
Timestamp | 2025-03-10 14:33:51 | 2025-03-10 14:37:03 | 2025-03-10 15:13:03 | 2025-03-10 15:14:05 |
Experiment Type | MPI; OpenMP; | same as r0 | same as r0 | same as r0 |
Machine | ip-172-31-47-249.ec2.internal | same as r0 | same as r0 | same as r0 |
Architecture | aarch64 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 |
Model Name | | | | |
Cache Size | | | | |
Number of Cores | | | | |
Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 |
OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 |
Compilation Options |
exec: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/utilities -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/IJ_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/krylov -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/seq_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG -O3 -mcpu=native -Wno-error=implicit-function-declaration -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -MF CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o.d -o CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -c /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls/ams.c -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include | exec: GNU C17 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs -g -O3 -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops | exec: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/utilities -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/IJ_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/krylov -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/seq_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG -Wno-error=implicit-function-declaration -O2 -mcpu=neoverse-v2 -armpl -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -MF CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o.d -o CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -c /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls/ams.c -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include | exec: GNU C17 14.2.0 -mcpu=neoverse-v2 -msve-vector-bits=128 -mlittle-endian -mabi=lp64 -g -O3 -O3 -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 96 | same as r0 | same as r0 | same as r0 |
Frequency Driver | NA | same as r0 | same as r0 | same as r0 |
Frequency Governor | NA | same as r0 | same as r0 | same as r0 |
Huge Pages | madvise | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 |
Number of sockets | 1 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.21.2 | same as r0 | same as r0 | same as r0 |
MAQAO build | 4d8b9c1e436de89978b8a89b2318f52a1555c3be::20250303-172211 | same as r0 | same as r0 | same as r0 |
Comments | | same as r0 | same as r0 | same as r0 |