Theme: MAQAO_theme darkgrey cyan
Help is available by moving the cursor above any symbol or by checking MAQAO website .
r0: orig
r1: aocc_10
r2: gcc_13
r3: icx_1
Metric r0 r1 r2 r3 Total Time (s) 23.40 21.86 21.52 29.57
Profiled Time (s) 22.05 20.50 19.22 28.25
GFLOPS 305.672 327.105 349.527 283.207
Time in analyzed loops (%) 82.2 84.1 87.7 85.3
Time in analyzed innermost loops (%) 44.7 44.6 28.0 18.1
Time in user code (%) 82.2 84.2 90.3 85.4
Compilation Options Score (%) 100 100 100 100
Array Access Efficiency (%) 77.6 76.8 100 94.4
Potential Speedups
Perfect Flow Complexity 1.00 1.00 1.00 1.00
Perfect OpenMP + MPI + Pthread 1.08 1.09 1.04 1.07
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution 1.27 1.21 1.21 1.21
No Scalar Integer Potential Speedup 1.16 1.17 1.48 1.43 Nb Loops to get 80% 1 1 1 1 FP Vectorised Potential Speedup 1.46 1.50 1.42 1.68 Nb Loops to get 80% 2 2 1 1 Fully Vectorised Potential Speedup 2.31 2.48 3.57 2.90 Nb Loops to get 80% 3 3 3 3 Only FP Arithmetic Potential Speedup 1.41 1.42 2.18 1.60 Nb Loops to get 80% 2 2 3 1
Source Object Issue
▼ exec–
▼ Collapse.hpp–
○
Source Object Issue
▼ libkripke.so–
▼ Collapse.hpp–
○
▼ [vdso]–
▼ –
○ -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
Source Object Issue
▼ libkripke.so–
▼ Collapse.hpp–
○
▼ SweepSubdomain.cpp–
○
▼ [vdso]–
▼ –
○ -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
Source Object Issue
▼ exec–
▼ Collapse.hpp–
○
r0 r1 r2 r3
Experiment Name
Application /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/run/oneview_runs/defaults/orig/exec /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/run/binaries/aocc_10/exec /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/run/binaries/gcc_13/exec /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/run/binaries/icx_1/exec
Timestamp 2024-02-21 10:09:13 2024-02-21 11:21:40 2024-02-21 11:23:08 2024-02-21 11:22:20
Experiment Type MPI; OpenMP; same as r0 same as r0 same as r0
Machine ins01.benchmarkcenter.megware.com same as r0 same as r0 same as r0
Architecture x86_64 same as r0 same as r0 same as r0
Micro Architecture ZEN_V4 same as r0 same as r0 same as r0
Model Name AMD EPYC 9654 96-Core Processor same as r0 same as r0 same as r0
Cache Size 1024 KB same as r0 same as r0 same as r0
Number of Cores 96 same as r0 same as r0 same as r0
Maximal Frequency 3.707812 GHz same as r0 same as r0 same as r0
OS Version Linux 5.14.0-362.13.1.el9_3.x86_64 #1 SMP PREEMPT_DYNAMIC Thu Dec 21 07:12:43 EST 2023 same as r0 same as r0 same as r0
Architecture used during static analysis x86_64 same as r0 same as r0 same as r0
Micro Architecture used during static analysis ZEN_V4 same as r0 same as r0 same as r0
Compilation Options
exec : AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src -I include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/include -I tpl/raja/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/cub -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/rocPRIM/rocprim/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/camp/include -O3 -march=native -g -grecord-command-line -fno-omit-frame-pointer -fcf-protection=none -nopie -Wall -Wextra -O3 -D NDEBUG -fPIC -fopenmp=libomp -std=c++14 -MD -MT CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o -c /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src/Kripke/Kernel/Scattering.cpp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include libkripke.so : AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D kripke_EXPORTS -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src -I include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/include -I tpl/raja/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/cub -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/rocPRIM/rocprim/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/camp/include -O3 -O3 -march=znver4 -mprefer-vector-width=512 -flto=full -g -grecord-command-line -fno-omit-frame-pointer -fcf-protection=none -nopie -Wall -Wextra -O3 -D NDEBUG -fPIC -fopenmp=libomp -std=c++14 -MD -MT CMakeFiles/kripke.dir/src/Kripke/Kernel/LPlusTimes.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Kernel/LPlusTimes.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Kernel/LPlusTimes.cpp.o -c /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src/Kripke/Kernel/LPlusTimes.cpp -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include + [vdso]: N/A libkripke.so : GNU GIMPLE 13.2.0 -march=znver4 -g -g -O2 -O2 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -fltrans + [vdso]: N/A exec : clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) /cluster/intel/oneapi/2024.0.0/compiler/2024.0/bin/compiler/clang --driver-mode=g++ --intel -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src -I include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/include -I tpl/raja/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/cub -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/rocPRIM/rocprim/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/camp/include -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -O3 -O3 -axCORE-AVX512 -g -grecord-command-line -fno-omit-frame-pointer -fcf-protection=none -nopie -Wall -Wextra -O3 -D NDEBUG -fPIC -fiopenmp -std=c++14 -MD -MT CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o -c /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src/Kripke/Kernel/LTimes.cpp -fveclib=SVML -fheinous-gnu-extensions --driver-mode=g++ --intel -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src -I include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/include -I tpl/raja/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/cub -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/rocPRIM/rocprim/include -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/tpl/raja/tpl/camp/include -I /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -O3 -O3 -axCORE-AVX512 -g -grecord-command-line -fno-omit-frame-pointer -fcf-protection=none -nopie -Wall -Wextra -O3 -D NDEBUG -fPIC -fiopenmp -std=c++14 -MD -MT CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Kernel/LTimes.cpp.o -c /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/build/Kripke/src/Kripke/Kernel/LTimes.cpp -fveclib=SVML -fheinous-gnu-extensions
Number of processes observed 2 same as r0 same as r0 same as r0
Number of threads observed 192 same as r0 same as r0 same as r0
Frequency Driver acpi-cpufreq same as r0 same as r0 same as r0
Frequency Governor performance same as r0 same as r0 same as r0
Huge Pages always same as r0 same as r0 same as r0
Hyperthreading on same as r0 same as r0 same as r0
Number of sockets 2 same as r0 same as r0 same as r0
Number of cores per socket 96 same as r0 same as r0 same as r0
MAQAO version 2.19.1 same as r0 same as r0 same as r0
MAQAO build e26c8ffcefb997f114892e36591c060f98f53e6a::20240206-190005 same as r0 same as r0 same as r0
Comments Execution on the Megware (https://www.megware.com/en/) benchmarking cluster same as r0 same as r0 same as r0