Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 2.03 E3 | 1.04 E3 | 537.54 | 281.95 | 152.59 | 90.88 | 51.46 | |
| Profiled Time (s) | 2.03 E3 | 1.03 E3 | 531.72 | 277.66 | 148.80 | 89.32 | 50.05 | |
| Time in analyzed loops (%) | 94.3 | 93.4 | 91.6 | 90.3 | 85.5 | 72.9 | 67.7 | |
| Time in analyzed innermost loops (%) | 87.5 | 86.5 | 84.7 | 83.1 | 78.6 | 67.0 | 62.1 | |
| Time in user code (%) | 0 | 0.56 | 2.09 | 3.11 | 7.04 | 20.7 | 24.6 | |
| Compilation Options Score (%) | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | |
| Perfect Flow Complexity | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | |
| Array Access Efficiency (%) | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | |
| Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.02 | |
| No Scalar Integer | Potential Speedup | 1.05 | 1.05 | 1.05 | 1.05 | 1.05 | 1.04 | 1.04 |
| Nb Loops to get 80% | 9 | 11 | 11 | 12 | 14 | 14 | 14 | |
| FP Vectorised | Potential Speedup | 1.42 | 1.41 | 1.40 | 1.39 | 1.36 | 1.29 | 1.26 |
| Nb Loops to get 80% | 5 | 5 | 5 | 5 | 5 | 4 | 4 | |
| Fully Vectorised | Potential Speedup | 1.91 | 1.92 | 1.89 | 1.87 | 1.79 | 1.60 | 1.54 |
| Nb Loops to get 80% | 11 | 12 | 12 | 13 | 13 | 12 | 13 | |
| Only FP Arithmetic | Potential Speedup | 1.43 | 1.44 | 1.43 | 1.42 | 1.39 | 1.32 | 1.29 |
| Nb Loops to get 80% | 13 | 15 | 15 | 15 | 16 | 15 | 16 | |
| Scalability - Gap | 1.00 | 1.02 | 1.06 | 1.11 | 1.20 | 1.43 | 1.62 | |
| Source Object | Issue |
|---|---|
| ▼libgromacs_mpi.so.7.0.0 | |
| ▼lincs.cpp | |
| ○ | -march=(target) is missing. |
| ▼pbc.cpp | |
| ○ | -march=(target) is missing. |
| ▼domdec.cpp | |
| ○ | -march=(target) is missing. |
| ▼pme_redistribute.cpp | |
| ○ | -march=(target) is missing. |
| ▼fft5d.cpp | |
| ○ | -march=(target) is missing. |
| ▼calc_verletbuf.cpp | |
| ○ | -march=(target) is missing. |
| ▼stl_vector.h | |
| ○ | -march=(target) is missing. |
| ▼threaded_force_buffer.cpp | |
| ○ | -march=(target) is missing. |
| ▼bonded.cpp | |
| ○ | -march=(target) is missing. |
| ▼pme_pp.cpp | |
| ○ | -march=(target) is missing. |
| ▼localtopology.cpp | |
| ○ | -march=(target) is missing. |
| ▼pme_grid.cpp | |
| ○ | -march=(target) is missing. |
| ▼pme_solve.cpp | |
| ○ | -march=(target) is missing. |
| ▼kernel_outer.h | |
| ○ | -march=(target) is missing. |
| ▼arm_neon.h | |
| ○ | -march=(target) is missing. |
| ▼manage_threading.cpp | |
| ○ | -march=(target) is missing. |
| ▼kernel_prune.cpp | |
| ○ | -march=(target) is missing. |
| ▼atomdata.cpp | |
| ○ | -march=(target) is missing. |
| ▼pairlist.cpp | |
| ○ | -march=(target) is missing. |
| ▼settle.cpp | |
| ○ | -march=(target) is missing. |
| ▼pairs.cpp | |
| ○ | -march=(target) is missing. |
| ▼vec.h | |
| ○ | -march=(target) is missing. |
| ▼sim_util.cpp | |
| ○ | -march=(target) is missing. |
| ▼grid.cpp | |
| ○ | -march=(target) is missing. |
| ▼md_support.cpp | |
| ○ | -march=(target) is missing. |
| ▼pme_spread.cpp | |
| ○ | -march=(target) is missing. |
| ▼redistribute.cpp | |
| ○ | -march=(target) is missing. |
| ▼update.cpp | |
| ○ | -march=(target) is missing. |
| ▼mdatoms.cpp | |
| ○ | -march=(target) is missing. |
| ▼domdec_constraints.cpp | |
| ○ | -march=(target) is missing. |
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
|---|---|---|---|---|---|---|---|
| Application | /home/eoseret/GROMACS/build/gcc_1/bin/gmx_mpi | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2023-02-21 16:28:39 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Machine | ip-172-31-8-114 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | arm64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | |||||||
| Cache Size | |||||||
| Number of Cores | |||||||
| Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 5.15.0-1030-aws #34~20.04.1-Ubuntu SMP Tue Jan 24 15:16:39 UTC 2023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | arm64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | libgromacs_mpi.so.7.0.0: GNU C++17 11.1.0 -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=c++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fexcess-precision=fast -funroll-all-loops -fopenmp -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of processes observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 |
| Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 |
| MAQAO version | 2.16.3 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | GNU 11.1 (SIMD=Neon), AWS G3 (Neoverse V1), scalability | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |