Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | |
|---|---|---|---|---|
| Total Time (s) | 16.77 | 38.94 | 19.11 | |
| Profiled Time (s) | 15.42 | 36.88 | 18.30 | |
| Time in analyzed loops (%) | 99.2 | 99.5 | 99.2 | |
| Time in analyzed innermost loops (%) | 74.8 | 25.7 | 63.0 | |
| Time in user code (%) | 99.9 | 99.7 | 99.3 | |
| Compilation Options Score (%) | 100 | 100 | 100 | |
| Array Access Efficiency (%) | Not Available | 76.7 | 71.9 | |
| Potential Speedups | ||||
| Iterations Count | Not Available | 1.02 | 1.19 | |
| Perfect Flow Complexity | 1.10 | 1.07 | 1.09 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.12 | 1.13 | 1.32 |
| Nb Loops to get 80% | 12 | 10 | 17 | |
| FP Vectorised | Potential Speedup | 1.01 | 1.05 | 1.31 |
| Nb Loops to get 80% | 2 | 3 | 5 | |
| Fully Vectorised | Potential Speedup | 1.56 | 1.60 | 4.34 |
| Nb Loops to get 80% | 24 | 22 | 33 | |
| Only FP Arithmetic | Potential Speedup | 1.41 | 3.19 | 1.52 |
| Nb Loops to get 80% | 27 | 22 | 25 | |
| Data In L1 Cache | Potential Speedup | Not Available | 1.00 | Not Available |
| Nb Loops to get 80% | Not Available | 1 | Not Available | |
| Source Object | Issue |
|---|---|
| ▼exec | |
| ▼IJVector_parcsr.c | |
| ○ | |
| ▼par_strength.c | |
| ○ | |
| ▼amg.c | |
| ○ | |
| ▼par_lr_interp.c | |
| ○ | |
| ▼csr_matrix.c | |
| ○ | |
| ▼random.c | |
| ○ | |
| ▼csr_matvec.c | |
| ○ | |
| ▼IJMatrix_parcsr.c | |
| ○ | |
| ▼par_coarsen.c | |
| ○ | |
| ▼csr_matop.c | |
| ○ | |
| ▼par_csr_matop.c | |
| ○ | |
| ▼vector.c | |
| ○ | |
| ▼ams.c | |
| ○ | |
| ▼par_multi_interp.c | |
| ○ |
| r0 | r1 | r2 | |
|---|---|---|---|
| Application | /home/hbollore/qaas/qaas-runs/169-817-3176/intel/AMG/run/binaries/gcc_2/exec | /home/kcamus/qaas_runs/169-443-9681/intel/AMG/run/binaries/gcc_2/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/gcc_12/exec |
| Timestamp | 2023-10-24 19:07:03 | 2023-09-11 18:50:08 | 2023-10-19 12:45:31 |
| Experiment Type | MPI; | same as r0 | same as r0 |
| Machine | ip-172-31-47-199 | skylake | ip-172-31-68-94 |
| Architecture | aarch64 | x86_64 | same as r1 |
| Micro Architecture | ARM_NEOVERSE_V1 | SKYLAKE | ZEN_V4 |
| Model Name | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | AMD EPYC 9R14 96-Core Processor | |
| Cache Size | 36608 KB | 1024 KB | |
| Number of Cores | 26 | 96 | |
| Maximal Frequency | 0 GHz | 2.1 GHz | 3.701953 GHz |
| OS Version | Linux 5.15.0-1048-aws #53~20.04.1-Ubuntu SMP Wed Oct 4 16:51:38 UTC 2023 | Linux 6.4.1-arch2-1 #1 SMP PREEMPT_DYNAMIC Tue, 04 Jul 2023 08:39:40 +0000 | Linux 6.2.0-1013-aws #13~22.04.1-Ubuntu SMP Fri Sep 8 17:29:56 UTC 2023 |
| Architecture used during static analysis | aarch64 | x86_64 | same as r1 |
| Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | SKYLAKE | ZEN_V4 |
| Compilation Options | exec: GNU C17 11.1.0 -mlittle-endian -mabi=lp64 -mcpu=zeus+crypto+sha3+sm4+nodotprod+noprofile -g -g -Ofast -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection | exec: GNU C89 13.1.1 20230429 -march=skylake-avx512 -mprefer-vector-width=512 -g -O3 -std=gnu90 -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops | libparcsr_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans exec: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libHYPRE_utilities.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libseq_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libIJ_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libparcsr_ls.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans |
| Number of processes observed | 1 | same as r0 | same as r0 |
| Number of threads observed | 1 | same as r0 | same as r0 |
| Frequency Driver | NA | intel_cpufreq | acpi-cpufreq |
| Frequency Governor | NA | schedutil | performance |
| Huge Pages | madvise | always | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 |
| Number of sockets | 1 | 2 | same as r1 |
| Number of cores per socket | 64 | 26 | 96 |
| MAQAO version | 2.17.9 | 2.17.8 | 2.18.0 |
| MAQAO build | 690431094d99a32cb85b834b2d457fa7bff1d94a::20230918-111356 | 70175eac56e139877d863e6478260132bd85e954::20230901-143618 | 44fc1f08bd133baf72fdfe51b209105f7e5da0e1::20231013-163433 |
| Comments | - | - | - |