Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 48.45 | 51.36 | 51.06 | 45.39 | 90.09 | 83.07 | 80.18 | 86.01 | |
| Profiled Time (s) | 37.17 | 36.28 | 36.23 | 34.44 | 78.05 | 73.91 | 68.35 | 77.68 | |
| GFLOPS | 155.585 | 116.368 | 185.951 | 151.705 | 88.754 | 77.462 | 119.928 | 80.015 | |
| Time in analyzed loops (%) | 89.6 | 89.7 | 90.3 | 89.6 | 95.6 | 96.4 | 95.6 | 95.7 | |
| Time in analyzed innermost loops (%) | 89.5 | 89.7 | 90.1 | 89.4 | 95.5 | 96.4 | 95.4 | 95.5 | |
| Time in user code (%) | 89.8 | 89.9 | 90.5 | 89.8 | 95.7 | 96.5 | 95.8 | 95.8 | |
| Compilation Options Score (%) | 0 | 100 | 100 | 0 | 0 | 100 | 100 | 0 | |
| Array Access Efficiency (%) | 82.9 | 84.3 | 81.5 | 83.8 | 86.7 | 89.5 | 83.5 | 87.8 | |
| Potential Speedups | |||||||||
| Iterations Count | 1.00 | Not Available | 1.00 | 1.00 | 1.00 | Not Available | 1.00 | 1.00 | |
| Perfect Flow Complexity | 1.00 | 1.10 | 1.00 | 1.00 | 1.00 | 1.09 | 1.00 | 1.03 | |
| Perfect OpenMP + MPI + Pthread | 1.08 | 1.05 | 1.09 | 1.08 | 1.04 | 1.02 | 1.04 | 1.04 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.15 | 1.11 | 1.16 | 1.14 | 1.07 | 1.04 | 1.07 | 1.06 | |
| No Scalar Integer | Potential Speedup | 1.01 | 1.02 | 1.00 | 1.01 | 1.01 | 1.02 | 1.00 | 1.11 |
| Nb Loops to get 80% | 3 | 2 | 3 | 3 | 3 | 3 | 3 | 9 | |
| FP Vectorised | Potential Speedup | 1.12 | 1.21 | 1.00 | 1.00 | 1.12 | 1.11 | 1.00 | 1.05 |
| Nb Loops to get 80% | 10 | 8 | 1 | 2 | 11 | 4 | 1 | 2 | |
| Fully Vectorised | Potential Speedup | 1.28 | 1.44 | 1.00 | 1.00 | 1.31 | 1.12 | 1.00 | 1.06 |
| Nb Loops to get 80% | 14 | 15 | 5 | 5 | 15 | 5 | 6 | 2 | |
| Only FP Arithmetic | Potential Speedup | 1.08 | 1.11 | 1.02 | 1.05 | 1.07 | 1.15 | 1.02 | 1.17 |
| Nb Loops to get 80% | 5 | 6 | 4 | 6 | 6 | 8 | 4 | 10 | |
| Data In L1 Cache | Potential Speedup | Not Available | Not Available | 1.04 | 1.05 | Not Available | Not Available | 1.05 | 1.05 |
| Nb Loops to get 80% | Not Available | Not Available | 1 | 1 | Not Available | Not Available | 1 | 1 | |
| Source Object | Issue |
|---|---|
| ▼exec | |
| ▼update_halo_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼calc_dt_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼revert_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼ideal_gas_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼flux_calc_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼viscosity_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼field_summary_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼advec_mom_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼accelerate_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼advec_cell_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼generate_chunk_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼reset_field_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼PdV_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| ▼initialise_chunk_kernel.f90 | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
|---|---|---|---|---|---|---|---|---|
| Application | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/gcc_4/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/icc_14/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/icx_2/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/gcc_10/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/icc_10/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/icx_14/exec |
| Timestamp | 2024-01-10 10:05:34 | 2024-01-10 12:39:28 | 2024-01-10 12:35:12 | 2024-01-10 12:31:22 | 2024-01-11 18:24:39 | 2024-01-11 22:34:37 | 2024-01-11 22:29:03 | 2024-01-11 22:23:03 |
| Experiment Type | MPI; OpenMP; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Machine | o406 | same as r0 | same as r0 | same as r0 | o401 | same as r4 | same as r4 | same as r4 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | Intel (R) Xeon (R) CPU Max 9480 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Cache Size | 115200 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of Cores | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 1.901 GHz | same as r0 | same as r0 | same as r0 | 1.9 GHz | same as r4 | same as r4 | same as r4 |
| OS Version | Linux 4.18.0-372.26.1.el8_6.x86_64 #1 SMP Sat Aug 27 02:44:20 EDT 2022 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | exec: Intel(R) Fortran 23.0-1769 | exec: GNU Fortran2008 12.1.0 -march=haswell -g -O3 -O3 -fallow-argument-mismatch -fno-tree-vectorize -fno-openmp-simd -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fintrinsic-modules-path /software/compilers/gcc/gcc-12.1.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/12.1.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O2 -xSAPPHIRERAPIDS -qopt-zmm-usage=high -flto -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 | same as r0 | exec: GNU GIMPLE 12.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O3 -O3 -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fltrans | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O3 -O3 -xSAPPHIRERAPIDS -qopt-zmm-usage=high -flto -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 |
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of threads observed | 113 | 112 | same as r1 | same as r1 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | userspace | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | on | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2.18.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | - | - | - | - | - | - | - | - |