Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
Total Time (s) | 150.54 | |
Max (Thread Active Time) (s) | 136.98 | |
Average Active Time (s) | 136.10 | |
Activity Ratio (%) | 90.5 | |
Average number of active threads | 86.790 | |
Affinity Stability (%) | 98.9 | |
Time in analyzed loops (%) | 55.5 | |
Time in analyzed innermost loops (%) | 53.1 | |
Time in user code (%) | 55.7 | |
Compilation Options Score (%) | 100 | |
Array Access Efficiency (%) | 73.4 | |
|
Potential Speedups |
Perfect Flow Complexity | 1.00 | |
Perfect OpenMP + MPI + Pthread | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.02 | |
No Scalar Integer | Potential Speedup | 1.08 | |
Nb Loops to get 80% | 5 | |
FP Vectorised | Potential Speedup | 1.00 | |
Nb Loops to get 80% | 4 | |
Fully Vectorised | Potential Speedup | 1.08 | |
Nb Loops to get 80% | 3 | |
FP Arithmetic Only | Potential Speedup | 1.24 | |
Nb Loops to get 80% | 7 | |
Source Object | Issue |
▼exec– | |
○WaveFunction.cpp | |
○TwoBodyJastrowRef.h | |
○NewTimer.cpp | |
○SoaDistanceTableAAOMPTarget.h | |
○stl_vector.h | |
○DiracMatrix.h | |
○ParticleSet.cpp | |
○NonLocalPP.hpp | |
○ParticleBConds3DSoa.h | |
○MultiBsplineRef.hpp | |
○DiracDeterminantRef.cpp | |
○einspline_spo_ref.hpp | |
○SoaDistanceTableABOMPTarget.h | |
○OneBodyJastrowRef.h | |
○BsplineFunctor.h | |
○miniqmc.cpp | |
○ParticleSet.h | |
Application | /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/orig/exec |
Timestamp | 2025-03-07 14:36:08 |
Universal Timestamp | 1741358168 |
Number of processes observed | 1 |
Number of threads observed | 96 |
Experiment Type | MPI; OpenMP; |
Machine | ip-172-31-47-249.ec2.internal |
Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V2 |
OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 |
Frequency Driver | NA |
Frequency Governor | NA |
Huge Pages | madvise |
Hyperthreading | off |
Number of sockets | 1 |
Number of cores per socket | 96 |
Compilation Options | exec: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D restrict=__restrict__ -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/build/src -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src/Particle -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src/Utilities -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src/Platforms -I /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src/Platforms/Host -O3 -mcpu=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp -fstrict-aliasing -Wvla -Wall -Wno-unused-variable -Wno-overloaded-virtual -Wno-unused-private-field -Wno-unused-local-typedef -Wno-unknown-pragmas -Wmisleading-indentation -ffast-math -O3 -D NDEBUG -std=c++17 -MD -MT src/QMCWaveFunctions/CMakeFiles/qmcwfs.dir/SPOSet_builder.cpp.o -MF src/QMCWaveFunctions/CMakeFiles/qmcwfs.dir/SPOSet_builder.cpp.o.d -o src/QMCWaveFunctions/CMakeFiles/qmcwfs.dir/SPOSet_builder.cpp.o -c /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/SPOSet_builder.cpp -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include | | |
Dataset | |
Run Command | <executable> -g "4 2 2" -b |
MPI Command | mpirun -n <number_processes> --bind-to core --map-by package:PE=96 --rank-by fill --report-bindings |
Number Processes | 1 |
Number Nodes | 1 |
Filter | Not Used |
Profile Start | Not Used |