Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 6.80 E3 | |
| Max (Thread Active Time) (s) | 2.44 E3 | |
| Average Active Time (s) | 1.70 E3 | |
| Activity Ratio (%) | 25.0 | |
| Average number of active threads | 24.252 | |
| Affinity Stability (%) | 98.9 | |
| Time in analyzed loops (%) | 63.8 | |
| Time in analyzed innermost loops (%) | 56.8 | |
| Time in user code (%) | 68.5 | |
| Compilation Options Score (%) | 100 | |
| Array Access Efficiency (%) | 75.9 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.01 | |
| Perfect OpenMP + MPI + Pthread | 1.66 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.96 | |
| No Scalar Integer | Potential Speedup | 1.13 | |
| Nb Loops to get 80% | 21 | |
| FP Vectorised | Potential Speedup | 1.03 | |
| Nb Loops to get 80% | 8 | |
| Fully Vectorised | Potential Speedup | 1.14 | |
| Nb Loops to get 80% | 26 | |
| FP Arithmetic Only | Potential Speedup | 1.41 | |
| Nb Loops to get 80% | 36 | |
| Source Object | Issue |
| ▼engine_linuxa64_ompi– | |
| ○m2cplr.F | |
| ○forint.F | |
| ○inter_count_node_curv.F | |
| ○accele.F | |
| ○spmd_i7tool.F | |
| ○r4evec3.F | |
| ○i7buce_crit.F | |
| ○trace_back.F | |
| ○cfint3.F | |
| ○inter_minmax_node.F | |
| ○cortdir3.F | |
| ○r4def3.F | |
| ○i7dst3.F | |
| ○depla.F | |
| ○ccoef3.F | |
| ○sortie_main.F | |
| ○dtnoda.F | |
| ○scoor3.F | |
| ○spmd_cell_size_exchange.F | |
| ○myqsort_int.F | |
| ○spmd_exch2_a_pon.F | |
| ○shvis3.F | |
| ○intfop2.F | |
| ○i7mainf.F | |
| ○rbilan.F | |
| ○rbyonf.F | |
| ○rgbodv.F | |
| ○sigeps02c.F | |
| ○i7ass3.F | |
| ○i7main_crit_tri.F | |
| ○spmd_i7xvcom2.F | |
| ○chvis3.F | |
| ○rforc3.F | |
| ○redef3.F | |
| ○asspar4.F | |
| ○forintc.F | |
| ○cstra3.F | |
| ○inter_cell_color.F | |
| ○vitesse.F | |
| ○bcs10.F | |
| ○rgbcor.F | |
| ○r4cum3p.F | |
| ○i7trivox.F | |
| ○rgbodfp.F | |
| ○ecrit.F | |
| ○cupdt3.F | |
| ○cderi3.F | |
| ○mulawglc.F | |
| ○r2len3.F | |
| ○i7cdcor3.F | |
| ○intcrit.F | |
| ○cnvec3.F | |
| ○deplafakeige.F | |
| ○sforc3.F | |
| ○ccoor3.F | |
| ○rgwall.F | |
| ○i7main_opt_tri.F | |
| ○inter_voxel_creation.F | |
| ○inttri.F | |
| ○i7optcd.F | |
| ○cdefo3.F | |
| ○cdlen3.F | |
| ○parit.F | |
| ○i7pen3.F | |
| ○i7cor3.F | |
| ○mulawc.F | |
| ○layini.F | |
| ○cbilan.F | |
| ○mmain.F90 | |
| ○sigeps01g.F | |
| ○resol.F | |
| ○spmd_i7fcom_pon.F | |
| ○ccurv3.F | |
| ○scumu3p.F | |
| ○hist2.F | |
| ○cmain3.F | |
| ○timer.F | |
| ○r2coor3.F | |
| ○cforc3.F | |
| ○i7for3.F | |
| Experiment Name | |
| Application | /home/hbollore/pop3/openradioss/OpenRadioss/exec/engine_linuxa64_ompi |
| Timestamp | 2025-01-31 16:39:31 |
Universal Timestamp | 1738341571 |
| Number of processes observed | 48 |
Number of threads observed | 97 |
| Experiment Type | MPI; OpenMP; |
| Machine | ip-172-31-47-249.ec2.internal |
| Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V2 |
| OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
| Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 |
| Frequency Driver | NA |
Frequency Governor | NA |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 96 |
| Compilation Options | engine_linuxa64_ompi: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/modules -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/r8 -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/spe_inc -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/cbuild_engine_linuxa64_ompi/CMakeFiles/includes_engine_linuxa64_ompi -g -mcpu=native -fno-omit-frame-pointer -module CMakeFiles/modules_engine_linuxa64_ompi -D WITHOUT_LINALG -mcpu=native -D COMP_ARMFLANG=1 -D ARCH_CPU=ARM -fopenmp -D MYREAL8 -ffixed-line-length-none -D MPI -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include/ -D CPP_mach=CPP_p4linux964 -D CPP_rel=70 -O3 -nofma -ffp-contract=off -fno-unsafe-math-optimizations -fno-fast-math -fveclib=none -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../extlib/h3d/includes -c -o CMakeFiles/engine_linuxa64_ompi.dir/source/materials/mat_share/mulawc.F.o | | |
| Comments | | | |
| Dataset | |
| Run Command | <executable> -i /home/hbollore/pop3/openradioss/dataset/N1M/NEON1M11_0001.rad |
| MPI Command | mpirun -n <number_processes> --bind-to core --map-by node:PE=<OMP_NUM_THREADS> --report-bindings |
| Number Processes | 48 |
| Number Nodes | 1 |
| Number Processes per Nodes | 48 |
| Filter | Not Used |
| Profile Start | Not Used |
| Maximal Path Number | 4 |