Theme: MAQAO_theme darkgrey cyan
Help is available by moving the cursor above any symbol or by checking MAQAO website .
Metric r0 r1 r2 r3 r4 r5 Total Time (s) 39.73 41.87 39.07 38.07 38.27 37.44
Max (Thread Active Time) (s) 38.99 41.08 38.33 37.37 37.38 36.73
Average Active Time (s) 38.96 41.03 38.27 37.30 37.34 36.67
Activity Ratio (%) 99.1 98.9 99.0 99.0 98.6 99.0
Average number of active threads 251.035 250.914 250.806 250.853 249.773 250.729
Affinity Stability (%) 99.8 99.8 99.8 99.8 99.8 99.8
Time in analyzed loops (%) 97.2 96.0 96.3 98.1 97.4 97.7
Time in analyzed innermost loops (%) 97.2 95.8 96.2 98.1 97.2 97.5
Time in user code (%) 97.2 96.0 96.6 98.1 97.5 97.7
Compilation Options Score (%) 66.7 0 0 100 100 100
Array Access Efficiency (%) 20.5 55.9 95.4 13.6 16.8 91.6
Potential Speedups
Perfect Flow Complexity 6.51 1.23 1.24 1.08 7.37 1.05
Perfect OpenMP + MPI + Pthread 1.01 1.01 1.01 1.01 1.01 1.01
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution 1.03 1.04 1.02 1.02 1.03 1.02
No Scalar Integer Potential Speedup 2.68 1.52 1.04 1.79 3.54 1.01 Nb Loops to get 80% 25 19 4 22 27 5 FP Vectorised Potential Speedup 1.24 1.09 1.47 1.06 1.18 1.05 Nb Loops to get 80% 12 10 12 10 11 2 Fully Vectorised Potential Speedup 6.62 5.18 4.12 5.11 7.59 1.06 Nb Loops to get 80% 28 28 26 28 28 2 Only FP Arithmetic Potential Speedup 3.65 6.00 1.11 6.93 3.79 1.25 Nb Loops to get 80% 26 28 8 29 27 9
Source Object Issue
▼ exec–
▼ calc_dt.cpp–
○ -march=(target) is missing.
▼ advec_cell.cpp–
○ -march=(target) is missing.
▼ accelerate.cpp–
○ -march=(target) is missing.
▼ reset_field.cpp–
○ -march=(target) is missing.
▼ revert.cpp–
○ -march=(target) is missing.
▼ viscosity.cpp–
○ -march=(target) is missing.
▼ initialise_chunk.cpp–
○ -march=(target) is missing.
▼ ideal_gas.cpp–
○ -march=(target) is missing.
▼ PdV.cpp–
○ -march=(target) is missing.
▼ build_field.cpp–
○ -march=(target) is missing.
▼ generate_chunk.cpp–
○ -march=(target) is missing.
▼ field_summary.cpp–
○ -march=(target) is missing.
▼ advec_mom.cpp–
○ -march=(target) is missing.
▼ flux_calc.cpp–
○ -march=(target) is missing.
Source Object Issue
▼ exec–
▼ –
○ -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○ -O2, -O3 or -Ofast is missing.
○ -march=(target) is missing.
Source Object Issue
▼ exec–
▼ –
○ -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○ -O2, -O3 or -Ofast is missing.
○ -march=(target) is missing.
Source Object Issue
▼ exec–
▼ calc_dt.cpp–
○
▼ advec_cell.cpp–
○
▼ accelerate.cpp–
○
▼ reset_field.cpp–
○
▼ revert.cpp–
○
▼ viscosity.cpp–
○
▼ initialise_chunk.cpp–
○
▼ ideal_gas.cpp–
○
▼ PdV.cpp–
○
▼ build_field.cpp–
○
▼ generate_chunk.cpp–
○
▼ field_summary.cpp–
○
▼ advec_mom.cpp–
○
▼ flux_calc.cpp–
○
Source Object Issue
▼ exec–
▼ calc_dt.cpp–
○
▼ advec_cell.cpp–
○
▼ accelerate.cpp–
○
▼ pack_kernel.cpp–
○
▼ reset_field.cpp–
○
▼ revert.cpp–
○
▼ viscosity.cpp–
○
▼ initialise_chunk.cpp–
○
▼ flux_calc.cpp–
○
▼ PdV.cpp–
○
▼ build_field.cpp–
○
▼ ideal_gas.cpp–
○
▼ field_summary.cpp–
○
▼ advec_mom.cpp–
○
▼ generate_chunk.cpp–
○
Source Object Issue
▼ exec–
▼ calc_dt.cpp–
○
▼ advec_cell.cpp–
○
▼ accelerate.cpp–
○
▼ reset_field.cpp–
○
▼ revert.cpp–
○
▼ viscosity.cpp–
○
▼ initialise_chunk.cpp–
○
▼ ideal_gas.cpp–
○
▼ PdV.cpp–
○
▼ flux_calc.cpp–
○
▼ field_summary.cpp–
○
▼ advec_mom.cpp–
○
▼ generate_chunk.cpp–
○
r0 r1 r2 r3 r4 r5
Experiment Name
Application /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/oneview_runs/defaults/orig/exec /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/base_runs/defaults/icx/exec /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/base_runs/defaults/gcc/exec /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/binaries/aocc_8/exec /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/binaries/icx_10/exec /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/run/binaries/gcc_5/exec
Timestamp 2025-02-12 16:36:36 2025-02-12 16:39:57 2025-02-12 16:43:08 2025-02-12 17:45:04 2025-02-12 17:46:08 2025-02-12 17:47:16
Experiment Type MPI; OpenMP; same as r0 same as r0 same as r0 same as r0 same as r0
Machine gmz10.benchmarkcenter.megware.com same as r0 same as r0 same as r0 same as r0 same as r0
Architecture x86_64 same as r0 same as r0 same as r0 same as r0 same as r0
Micro Architecture ZEN_V5 same as r0 same as r0 same as r0 same as r0 same as r0
Model Name AMD EPYC 9745 128-Core Processor same as r0 same as r0 same as r0 same as r0 same as r0
Cache Size 1024 KB same as r0 same as r0 same as r0 same as r0 same as r0
Number of Cores 128 same as r0 same as r0 same as r0 same as r0 same as r0
Maximal Frequency 3.707812 GHz same as r0 same as r0 same as r0 same as r0 same as r0
OS Version Linux 5.14.0-503.19.1.el9_5.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Jan 7 17:08:27 EST 2025 same as r0 same as r0 same as r0 same as r0 same as r0
Architecture used during static analysis x86_64 same as r0 same as r0 same as r0 same as r0 same as r0
Micro Architecture used during static analysis ZEN_V5 same as r0 same as r0 same as r0 same as r0 same as r0
Compilation Options
exec : AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 --driver-mode=g++ -D USE_OMP -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/omp -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/build/generated -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/driver -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp -g -fno-omit-frame-pointer -fcf-protection=none -nopie -grecord-command-line -D NDEBUG -std=c++17 -Wall -Wno-unused-parameter -Wno-unused-function -Wno-unused-variable -O3 -fopenmp=libomp -MD -MT CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -MF CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o.d -o CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -c /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp/advec_mom.cpp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include exec : exec : N/A exec : AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 --driver-mode=g++ -D USE_OMP -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/omp -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/aocc_8/generated -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/driver -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp -O2 -march=znver5 -mprefer-vector-width=256 -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -nopie -grecord-command-line -D NDEBUG -std=c++17 -Wall -Wno-unused-parameter -Wno-unused-function -Wno-unused-variable -fopenmp=libomp -MD -MT CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -MF CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o.d -o CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -c /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp/advec_mom.cpp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include exec : clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) /cluster/intel/oneapi/2024.0.0/compiler/2024.0/bin/compiler/clang --driver-mode=g++ --intel -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/omp -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/icx_10/generated -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/driver -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -D USE_OMP -O2 -axCORE-AVX512 -fno-vectorize -fno-slp-vectorize -fno-iopenmp-simd -g -fno-omit-frame-pointer -fcf-protection=none -nopie -grecord-command-line -D NDEBUG -std=c++17 -Wall -Wno-unused-parameter -Wno-unused-function -Wno-unused-variable -fiopenmp -MD -MT CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -MF CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o.d -o CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -c /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp/advec_mom.cpp -fveclib=SVML -fheinous-gnu-extensions --driver-mode=g++ --intel -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/omp -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/icx_10/generated -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/driver -I /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -D USE_OMP -O2 -axCORE-AVX512 -fno-vectorize -fno-slp-vectorize -fno-iopenmp-simd -g -fno-omit-frame-pointer -fcf-protection=none -nopie -grecord-command-line -D NDEBUG -std=c++17 -Wall -Wno-unused-parameter -Wno-unused-function -Wno-unused-variable -fiopenmp -MD -MT CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -MF CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o.d -o CMakeFiles/cloverleaf.dir/src/omp/advec_mom.cpp.o -c /home/eoseret/qaas_runs_ZEN5/173-937-4090/intel/CloverLeaf2.0-CXX/build/CloverLeaf2.0-CXX/src/omp/advec_mom.cpp -fveclib=SVML -fheinous-gnu-extensions exec : GNU C++17 14.2.0 -march=znver5 -g -O3 -O3 -std=c++17 -fno-tree-vectorize -fno-openmp-simd -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp
Number of processes observed 8 same as r0 same as r0 same as r0 same as r0 same as r0
Number of threads observed 256 same as r0 same as r0 same as r0 same as r0 same as r0
Frequency Driver acpi-cpufreq same as r0 same as r0 same as r0 same as r0 same as r0
Frequency Governor performance same as r0 same as r0 same as r0 same as r0 same as r0
Huge Pages always same as r0 same as r0 same as r0 same as r0 same as r0
Hyperthreading on same as r0 same as r0 same as r0 same as r0 same as r0
Number of sockets 2 same as r0 same as r0 same as r0 same as r0 same as r0
Number of cores per socket 128 same as r0 same as r0 same as r0 same as r0 same as r0
MAQAO version 2.21.1 same as r0 same as r0 same as r0 same as r0 same as r0
MAQAO build 8271f65b618decdd516f3bd4a943e5566ffabed6::20250211-191351 same as r0 same as r0 same as r0 same as r0 same as r0
Comments same as r0 same as r0 same as r0 same as r0 same as r0