Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 |
|---|
| Total Time (s) | 42.03 | 42.83 | 42.09 | 42.37 |
| Max (Thread Active Time) (s) | 18.54 | 16.02 | 21.47 | 16.08 |
| Average Active Time (s) | 15.59 | 12.42 | 15.58 | 12.40 |
| Activity Ratio (%) | 37.2 | 29.0 | 37.1 | 29.3 |
| Average number of active threads | 35.621 | 27.834 | 35.545 | 28.102 |
| Affinity Stability (%) | 95.5 | 90.8 | 95.5 | 91.7 |
| Time in analyzed loops (%) | 79.2 | 98.4 | 79.0 | 98.5 |
| Time in analyzed innermost loops (%) | 72.3 | 90.3 | 72.6 | 55.7 |
| Time in user code (%) | 79.3 | 98.5 | 79.1 | 98.5 |
| Compilation Options Score (%) | 100 | 100 | 100 | 100 |
| Array Access Efficiency (%) | 60.6 | 35.1 | 65.0 | 48.9 |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | 1.02 | 1.01 | 1.02 |
| Perfect OpenMP + MPI + Pthread | 1.15 | 1.01 | 1.31 | 1.01 |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.50 | 1.31 | 1.74 | 1.31 |
| No Scalar Integer | Potential Speedup | 1.04 | 1.04 | 1.05 | 1.12 |
| Nb Loops to get 80% | 11 | 12 | 11 | 6 |
| FP Vectorised | Potential Speedup | 1.29 | 1.01 | 1.28 | 1.02 |
| Nb Loops to get 80% | 4 | 5 | 3 | 6 |
| Fully Vectorised | Potential Speedup | 1.37 | 1.09 | 1.30 | 1.15 |
| Nb Loops to get 80% | 17 | 23 | 19 | 17 |
| Only FP Arithmetic | Potential Speedup | 1.09 | 1.14 | 1.10 | 1.62 |
| Nb Loops to get 80% | 25 | 23 | 23 | 13 |
| Source Object | Issue |
| ▼exec– | |
| ▼par_strength.c– | |
| ○ | |
| ▼par_lr_interp.c– | |
| ○ | |
| ▼vector.c– | |
| ○ | |
| ▼par_multi_interp.c– | |
| ○ | |
| ▼csr_matvec.c– | |
| ○ | |
| ▼IJMatrix_parcsr.c– | |
| ○ | |
| ▼amg.c– | |
| ○ | |
| ▼csr_matop.c– | |
| ○ | |
| ▼par_csr_matop.c– | |
| ○ | |
| ▼random.c– | |
| ○ | |
| ▼par_coarsen.c– | |
| ○ | |
| ▼ams.c– | |
| ○ | |
| Source Object | Issue |
| ▼exec– | |
| ▼par_strength.c– | |
| ○ | |
| ▼par_lr_interp.c– | |
| ○ | |
| ▼vector.c– | |
| ○ | |
| ▼par_coarsen.c– | |
| ○ | |
| ▼csr_matvec.c– | |
| ○ | |
| ▼IJMatrix_parcsr.c– | |
| ○ | |
| ▼amg.c– | |
| ○ | |
| ▼csr_matop.c– | |
| ○ | |
| ▼par_csr_matop.c– | |
| ○ | |
| ▼random.c– | |
| ○ | |
| ▼ams.c– | |
| ○ | |
| ▼par_multi_interp.c– | |
| ○ | |
| Source Object | Issue |
| ▼exec– | |
| ▼par_strength.c– | |
| ○ | |
| ▼par_lr_interp.c– | |
| ○ | |
| ▼vector.c– | |
| ○ | |
| ▼random.c– | |
| ○ | |
| ▼csr_matvec.c– | |
| ○ | |
| ▼IJMatrix_parcsr.c– | |
| ○ | |
| ▼par_coarsen.c– | |
| ○ | |
| ▼csr_matop.c– | |
| ○ | |
| ▼par_csr_matop.c– | |
| ○ | |
| ▼ams.c– | |
| ○ | |
| ▼amg.c– | |
| ○ | |
| ▼par_multi_interp.c– | |
| ○ | |
| Source Object | Issue |
| ▼exec– | |
| ▼par_strength.c– | |
| ○ | |
| ▼par_lr_interp.c– | |
| ○ | |
| ▼vector.c– | |
| ○ | |
| ▼par_coarsen.c– | |
| ○ | |
| ▼csr_matvec.c– | |
| ○ | |
| ▼IJMatrix_parcsr.c– | |
| ○ | |
| ▼amg.c– | |
| ○ | |
| ▼csr_matop.c– | |
| ○ | |
| ▼par_csr_matop.c– | |
| ○ | |
| ▼random.c– | |
| ○ | |
| ▼ams.c– | |
| ○ | |
| ▼par_multi_interp.c– | |
| ○ | |
| r0 | r1 | r2 | r3 |
| Experiment Name | | | | |
| Application | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/oneview_runs/defaults/orig/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/base_runs/defaults/gcc/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/binaries/armclang_5/exec | /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/run/binaries/gcc_2/exec |
| Timestamp | 2025-03-10 14:33:51 | 2025-03-10 14:37:03 | 2025-03-10 15:13:03 | 2025-03-10 15:14:05 |
| Experiment Type | MPI; OpenMP; | same as r0 | same as r0 | same as r0 |
| Machine | ip-172-31-47-249.ec2.internal | same as r0 | same as r0 | same as r0 |
| Architecture | aarch64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 |
| Model Name | | | | |
| Cache Size | | | | |
| Number of Cores | | | | |
| Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 |
| Compilation Options |
exec: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/utilities -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/IJ_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/krylov -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/seq_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG -O3 -mcpu=native -Wno-error=implicit-function-declaration -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -MF CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o.d -o CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -c /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls/ams.c -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include | exec: GNU C17 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs -g -O3 -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops | exec: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/utilities -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/IJ_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/krylov -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/seq_mv -I /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG -Wno-error=implicit-function-declaration -O2 -mcpu=neoverse-v2 -armpl -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -MF CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o.d -o CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -c /home/hbollore/qaas/qaas-runs/174-161-6712/intel/AMG/build/AMG/AMG/parcsr_ls/ams.c -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include | exec: GNU C17 14.2.0 -mcpu=neoverse-v2 -msve-vector-bits=128 -mlittle-endian -mabi=lp64 -g -O3 -O3 -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops |
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 |
| Number of threads observed | 96 | same as r0 | same as r0 | same as r0 |
| Frequency Driver | NA | same as r0 | same as r0 | same as r0 |
| Frequency Governor | NA | same as r0 | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 |
| Number of sockets | 1 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2.21.2 | same as r0 | same as r0 | same as r0 |
| MAQAO build | 4d8b9c1e436de89978b8a89b2318f52a1555c3be::20250303-172211 | same as r0 | same as r0 | same as r0 |
| Comments | | same as r0 | same as r0 | same as r0 |