options

exec - 2024-04-27 04:32:36 - MAQAO 2.20.0

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)100.88
Profiled Time (s)86.33
GFLOPS581.079
Time in analyzed loops (%)71.0
Time in analyzed innermost loops (%)70.8
Time in user code (%)71.4
Compilation Options Score (%)99.9
Array Access Efficiency (%)92.9
Potential Speedups
Perfect Flow Complexity1.02
Perfect OpenMP + MPI + Pthread1.00
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.05
No Scalar IntegerPotential Speedup1.02
Nb Loops to get 80%1
FP VectorisedPotential Speedup1.34
Nb Loops to get 80%3
Fully VectorisedPotential Speedup1.98
Nb Loops to get 80%5
FP Arithmetic OnlyPotential Speedup1.11
Nb Loops to get 80%3

CQA Potential Speedups Summary

Loop Based Profile

Innermost Loop Based Profile

Application Categorization

Compilation Options

Source ObjectIssue
[vdso]
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
exec
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
libqmcparticle.so
ParticleSet.cpp
libqmcwfs.so
OneBodyJastrow.h
TinyVectorTensorOps.h
OhmmsVector.h
TwoBodyJastrowRef.h
DiracDeterminantRef.cpp
MultiBsplineRef.hpp
einspline_spo_ref.hpp
inner_product.hpp
WaveFunction.cpp
DiracMatrix.h
OneBodyJastrowRef.h
DelayedUpdate.h
SPOSet.h
BsplineFunctor.h
libqmcutil.so
NewTimer.cpp
libqmcparticle_omptarget.so
ParticleBConds3DSoa.h
SoaDistanceTableAAOMPTarget.h
SoaDistanceTableABOMPTarget.h

Loop Path Count Profile

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If FP Arithmetic Only

Experiment Summary

Application/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/binaries/gcc_11/exec
Timestamp2024-04-27 04:32:36 Universal Timestamp1714185156
Number of processes observed2 Number of threads observed114
Experiment TypeMPI; OpenMP;
Machineo404
Model NameIntel (R) Xeon (R) CPU Max 9480
Architecturex86_64 Micro ArchitectureSAPPHIRE_RAPIDS
Cache Size115200 KB Number of Cores56
OS VersionLinux 4.18.0-477.27.1.el8_8.x86_64 #1 SMP Thu Aug 31 10:29:22 EDT 2023
Architecture used during static analysisx86_64 Micro Architecture used during static analysisSAPPHIRE_RAPIDS
Frequency Driveracpi-cpufreq Frequency Governorperformance
Huge Pagesnever Hyperthreadingon
Number of sockets2 Number of cores per socket56
Compilation Options+ [vdso]: N/A
exec: N/A
libqmcparticle.so: GNU C++17 13.1.0 -march=haswell -g -O3 -O3 -O3 -std=c++17 -flto -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fPIC
libqmcparticle_omptarget.so: GNU GIMPLE 13.1.0 -march=haswell -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans
libqmcutil.so: GNU GIMPLE 13.1.0 -march=haswell -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans
libqmcwfs.so: GNU GIMPLE 13.1.0 -march=haswell -g -g -O3 -O3 -O3 -O3 -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -funroll-loops -fno-omit-frame-pointer -fcf-protection=none -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -fltrans

Configuration Summary

Dataset
Run Command<executable> -g "4 2 2" -b
MPI Commandmpirun -np 2
Number Processes1
Number Nodes1
FilterNot Used
Profile StartNot Used
×