Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | |
|---|---|---|---|---|
| Total Time (s) | 19.88 | 19.98 | 19.11 | |
| Profiled Time (s) | 18.99 | 19.09 | 18.30 | |
| Time in analyzed loops (%) | 99.4 | 99.4 | 99.2 | |
| Time in analyzed innermost loops (%) | 80.0 | 80.4 | 63.0 | |
| Time in user code (%) | 99.5 | 99.5 | 99.3 | |
| Compilation Options Score (%) | 0 | 0 | 100 | |
| Array Access Efficiency (%) | 67.5 | 67.3 | 71.9 | |
| Potential Speedups | ||||
| Iterations Count | Not Available | Not Available | 1.19 | |
| Perfect Flow Complexity | 1.09 | 1.09 | 1.09 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.14 | 1.14 | 1.32 |
| Nb Loops to get 80% | 10 | 10 | 17 | |
| FP Vectorised | Potential Speedup | 1.50 | 1.50 | 1.31 |
| Nb Loops to get 80% | 4 | 5 | 5 | |
| Fully Vectorised | Potential Speedup | 4.94 | 4.94 | 4.34 |
| Nb Loops to get 80% | 41 | 41 | 33 | |
| Only FP Arithmetic | Potential Speedup | 1.34 | 1.35 | 1.52 |
| Nb Loops to get 80% | 29 | 28 | 25 | |
| Source Object | Issue |
|---|---|
| ▼exec | |
| ▼IJVector_parcsr.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_coarsen.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_strength.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼random.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_lr_interp.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼vector.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼ams.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼csr_matvec.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼IJMatrix_parcsr.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼amg.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼csr_matop.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_coarse_parms.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_csr_matop.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼hypre_qsort.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ▼par_multi_interp.c | |
| ○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| r0 | r1 | r2 | |
|---|---|---|---|
| Application | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/oneview_runs/orig/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/clang_14/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/gcc_12/exec |
| Timestamp | 2023-10-19 11:48:15 | 2023-10-19 12:44:48 | 2023-10-19 12:45:31 |
| Experiment Type | MPI; | same as r0 | same as r0 |
| Machine | ip-172-31-68-94 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 |
| Micro Architecture | ZEN_V4 | same as r0 | same as r0 |
| Model Name | AMD EPYC 9R14 96-Core Processor | same as r0 | same as r0 |
| Cache Size | 1024 KB | same as r0 | same as r0 |
| Number of Cores | 96 | same as r0 | same as r0 |
| Maximal Frequency | 3.701953 GHz | same as r0 | same as r0 |
| OS Version | Linux 6.2.0-1013-aws #13~22.04.1-Ubuntu SMP Fri Sep 8 17:29:56 UTC 2023 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ZEN_V4 | same as r0 | same as r0 |
| Compilation Options | exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libparcsr_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libHYPRE_utilities.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libseq_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libIJ_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libparcsr_ls.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libparcsr_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans exec: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libHYPRE_utilities.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libseq_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libIJ_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libparcsr_ls.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans |
| Number of processes observed | 1 | same as r0 | same as r0 |
| Number of threads observed | 1 | same as r0 | same as r0 |
| Frequency Driver | acpi-cpufreq | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 |
| Number of cores per socket | 96 | same as r0 | same as r0 |
| MAQAO version | 2.18.0 | same as r0 | same as r0 |
| MAQAO build | 44fc1f08bd133baf72fdfe51b209105f7e5da0e1::20231013-163433 | same as r0 | same as r0 |
| Comments | - | - | - |