options

exec - 2024-04-27 06:42:58 - MAQAO 2.20.0

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)132.03
Profiled Time (s)119.16
GFLOPS590.664
Time in analyzed loops (%)66.7
Time in analyzed innermost loops (%)66.5
Time in user code (%)67.0
Compilation Options Score (%)99.9
Array Access Efficiency (%)92.1
Potential Speedups
Perfect Flow Complexity1.02
Perfect OpenMP + MPI + Pthread1.00
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.06
No Scalar IntegerPotential Speedup1.02
Nb Loops to get 80%1
FP VectorisedPotential Speedup1.39
Nb Loops to get 80%3
Fully VectorisedPotential Speedup1.88
Nb Loops to get 80%5
FP Arithmetic OnlyPotential Speedup1.10
Nb Loops to get 80%3

CQA Potential Speedups Summary

Loop Based Profile

Innermost Loop Based Profile

Application Categorization

Compilation Options

Source ObjectIssue
[vdso]
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
libqmcparticle.so
ParticleSet.cpp
libqmcwfs.so
OneBodyJastrow.h
TinyVectorTensorOps.h
OhmmsVector.h
TwoBodyJastrowRef.h
DiracDeterminantRef.cpp
TinyVector.h
einspline_spo_ref.hpp
MultiBsplineRef.hpp
WaveFunction.cpp
OneBodyJastrowRef.h
DiracMatrix.h
BsplineFunctor.h
SPOSet.h
exec
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
libqmcutil.so
NewTimer.cpp
libqmcparticle_omptarget.so
ParticleBConds3DSoa.h
SoaDistanceTableABOMPTarget.h
SoaDistanceTableAAOMPTarget.h

Loop Path Count Profile

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If FP Arithmetic Only

Experiment Summary

Application/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/binaries/gcc_9/exec
Timestamp2024-04-27 06:42:58 Universal Timestamp1714192978
Number of processes observed2 Number of threads observed114
Experiment TypeMPI; OpenMP;
Machineo405
Model NameIntel (R) Xeon (R) CPU Max 9480
Architecturex86_64 Micro ArchitectureSAPPHIRE_RAPIDS
Cache Size115200 KB Number of Cores56
OS VersionLinux 4.18.0-477.27.1.el8_8.x86_64 #1 SMP Thu Aug 31 10:29:22 EDT 2023
Architecture used during static analysisx86_64 Micro Architecture used during static analysisSAPPHIRE_RAPIDS
Frequency Driveracpi-cpufreq Frequency Governorperformance
Huge Pagesnever Hyperthreadingon
Number of sockets2 Number of cores per socket56
Compilation Options+ [vdso]: N/A
exec: N/A
libqmcparticle.so: GNU C++17 13.1.0 -march=sapphirerapids -g -O3 -O3 -O3 -std=c++17 -flto -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fPIC
libqmcparticle_omptarget.so: GNU GIMPLE 13.1.0 -march=sapphirerapids -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans
libqmcutil.so: GNU GIMPLE 13.1.0 -march=sapphirerapids -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans
libqmcwfs.so: GNU GIMPLE 13.1.0 -march=sapphirerapids -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans

Configuration Summary

Dataset
Run Command<executable> -g "4 2 2" -b
MPI Commandmpirun -np 2
Number Processes1
Number Nodes1
FilterNot Used
Profile StartNot Used
×