Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 2.53 E3 | |
| Max (Thread Active Time) (s) | 1.97 E3 | |
| Average Active Time (s) | 1.71 E3 | |
| Activity Ratio (%) | 67.7 | |
| Average number of active threads | 66.962 | |
| Affinity Stability (%) | 96.9 | |
| Time in analyzed loops (%) | 79.7 | |
| Time in analyzed innermost loops (%) | 74.8 | |
| Time in user code (%) | 82.9 | |
| Compilation Options Score (%) | 100 | |
| Array Access Efficiency (%) | 69.3 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.03 | |
| Perfect OpenMP + MPI + Pthread | 1.12 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.24 | |
| No Scalar Integer | Potential Speedup | 1.18 | |
| Nb Loops to get 80% | 40 | |
| FP Vectorised | Potential Speedup | 1.05 | |
| Nb Loops to get 80% | 15 | |
| Fully Vectorised | Potential Speedup | 1.21 | |
| Nb Loops to get 80% | 41 | |
| FP Arithmetic Only | Potential Speedup | 1.45 | |
| Nb Loops to get 80% | 41 | |
| Source Object | Issue |
| ▼engine_linuxa64_ompi– | |
| ○m2cplr.F | |
| ○forint.F | |
| ○layini.F | |
| ○accele.F | |
| ○cmain3.F | |
| ○r4evec3.F | |
| ○i7buce_crit.F | |
| ○mstiforthv.F | |
| ○upd_failwave_sh4n.F | |
| ○roto.F | |
| ○sigeps28.F | |
| ○r4def3.F | |
| ○sortho3.F | |
| ○set_failwave_nod4.F | |
| ○s4sav3.F | |
| ○srrota3.F | |
| ○mqviscb.F | |
| ○sortie_main.F | |
| ○dtnoda.F | |
| ○szordef3.F | |
| ○czdef.F | |
| ○slen.F | |
| ○czcorc.F | |
| ○get_volume_area.F90 | |
| ○sfint3.F | |
| ○spmd_cell_size_exchange.F | |
| ○mmodul.F | |
| ○fail_hashin_c.F | |
| ○gfhour_or.F | |
| ○srota3.F | |
| ○intfop2.F | |
| ○i7mainf.F | |
| ○cndt3.F | |
| ○czproj.F | |
| ○update_failwave.F | |
| ○szderi3.F | |
| ○sorthdir3.F | |
| ○mat25_crasurv_c.F90 | |
| ○fail_biquad_s.F | |
| ○vinter.F | |
| ○sigeps02c.F | |
| ○rgwall.F | |
| ○s4forc3.F | |
| ○s4deri3.F | |
| ○spmd_i7xvcom2.F | |
| ○smallb3.F | |
| ○rforc3.F | |
| ○czcorp5.F | |
| ○storth3.F | |
| ○srho3.F | |
| ○m25crak.F | |
| ○i2vit3.F | |
| ○i7dst3.F | |
| ○sdlen3.F | |
| ○uroto.F | |
| ○srcoor3.F | |
| ○szhour3_or.F | |
| ○i7cor3.F | |
| ○sroto3.F | |
| ○mmain.F90 | |
| ○redef3.F | |
| ○asspar4.F | |
| ○m2law.F | |
| ○czfintce.F | |
| ○schkjabt3.F | |
| ○cupdtn3.F | |
| ○sroto3v.F | |
| ○czstra3.F | |
| ○forintc.F | |
| ○srmall3.F | |
| ○mrotens.F | |
| ○fail_biquad_c.F | |
| ○i7pen3.F | |
| ○cdkcoor3.F | |
| ○upd_failwave_sh3n.F | |
| ○intti1.F | |
| ○mreploc.F | |
| ○cbatran3v.F | |
| ○initbuf.F | |
| ○i7for3.F | |
| ○i7trivox.F | |
| ○cncoef3.F | |
| ○c3updt3.F | |
| ○gravit.F | |
| ○s4defo3.F | |
| ○sstra3.F | |
| ○mstrain_rate.F | |
| ○czfintn.F | |
| ○deplafakeige.F | |
| ○mulawc.F | |
| ○mmod_norm.F | |
| ○hist2.F | |
| ○i7cdcor3.F | |
| ○parit.F | |
| ○szforc3.F | |
| ○gettransv.F | |
| ○i2for3p.F | |
| ○czforc3.F | |
| ○s4fint3.F | |
| ○depla.F | |
| ○sigeps25c.F | |
| ○bcs10.F | |
| ○i7optcd.F | |
| ○s4cumu3p.F | |
| ○c3forc3.F | |
| ○srepiso3.F | |
| ○ecrit.F | |
| ○s4mall3.F | |
| ○fail_setoff_c.F | |
| ○cbilan.F | |
| ○resol.F | |
| ○sreploc3.F | |
| ○s8sav3.F | |
| ○s4coor3.F | |
| ○cmatc3.F | |
| ○spmd_exch2_a_pon.F | |
| ○scumu3p.F | |
| ○c3coor3.F | |
| ○sdefo3.F | |
| ○szsvm_or.F | |
| ○mulaw.F90 | |
| ○volpresp.F | |
| ○vitesse.F | |
| Experiment Name | |
| Application | /home/hbollore/pop3/openradioss/OpenRadioss/exec/engine_linuxa64_ompi |
| Timestamp | 2025-01-31 19:46:37 |
Universal Timestamp | 1738352797 |
| Number of processes observed | 48 |
Number of threads observed | 99 |
| Experiment Type | MPI; OpenMP; |
| Machine | ip-172-31-47-249.ec2.internal |
| Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V2 |
| OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
| Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 |
| Frequency Driver | NA |
Frequency Governor | NA |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 96 |
| Compilation Options | engine_linuxa64_ompi: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/modules -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/r8 -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/spe_inc -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/cbuild_engine_linuxa64_ompi/CMakeFiles/includes_engine_linuxa64_ompi -g -mcpu=native -fno-omit-frame-pointer -module CMakeFiles/modules_engine_linuxa64_ompi -D WITHOUT_LINALG -mcpu=native -D COMP_ARMFLANG=1 -D ARCH_CPU=ARM -fopenmp -D MYREAL8 -ffixed-line-length-none -D MPI -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include/ -D CPP_mach=CPP_p4linux964 -D CPP_rel=70 -O3 -nofma -ffp-contract=off -fno-unsafe-math-optimizations -fno-fast-math -fveclib=none -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../extlib/h3d/includes -c -o CMakeFiles/engine_linuxa64_ompi.dir/source/materials/mat/mat025/mat25_crasurv_c.F90.o | | |
| Comments | | | |
| Dataset | |
| Run Command | <executable> -i /home/hbollore/pop3/openradioss/dataset/SKYCAB/ML2_FEM_FULL_MODEL_MB_04_11_2022_0001.rad |
| MPI Command | mpirun -n <number_processes> --bind-to core --map-by node:PE=<OMP_NUM_THREADS> --report-bindings |
| Number Processes | 48 |
| Number Nodes | 1 |
| Number Processes per Nodes | 48 |
| Filter | Not Used |
| Profile Start | Not Used |
| Maximal Path Number | 4 |