Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 279.91 | |
| Max (Thread Active Time) (s) | 278.54 | |
| Average Active Time (s) | 278.54 | |
| Activity Ratio (%) | 99.5 | |
| Average number of active threads | 0.995 | |
| Affinity Stability (%) | 99.6 | |
| GFLOPS | 8.782 | |
| Time in analyzed loops (%) | 98.5 | |
| Time in analyzed innermost loops (%) | 98.5 | |
| Time in user code (%) | 99.9 | |
| Compilation Options Score (%) | 75.0 | |
| Array Access Efficiency (%) | 65.7 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| FP Vectorised | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| Fully Vectorised | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 2 | |
| FP Arithmetic Only | Potential Speedup | 1.60 | |
| Nb Loops to get 80% | 3 | |
| Source Object | Issue |
| ▼bench– | |
| ○vrank-geq1.c | -funroll-loops is missing. |
| ○t2fv_4.c | -funroll-loops is missing. |
| ○dftw-direct.c | -funroll-loops is missing. |
| ○direct.c | -funroll-loops is missing. |
| ○execute.c | -funroll-loops is missing. |
| ○solve.c | -funroll-loops is missing. |
| ○ct.c | -funroll-loops is missing. |
| ○t2fv_16.c | -funroll-loops is missing. |
| ○n2fv_8.c | -funroll-loops is missing. |
| Experiment Name | FFTW GCC-G3-NO_SVE |
| Application | /home/fmusial/FFTW_Benchmarks/fftw-3.3.10-gcc-G3-no_sve/tests/bench |
| Timestamp | 2025-04-23 09:14:57 |
Universal Timestamp | 1745399697 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | Sequential |
| Machine | ip-172-31-47-249.ec2.internal |
| Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V2 |
| OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
| Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 |
| Frequency Driver | NA |
Frequency Governor | NA |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 96 |
| Compilation Options | bench: GNU C17 14.2.0 -march=armv9-a -mtune=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O3 -fno-omit-frame-pointer | | |
| Dataset | |
| Run Command | <executable> -v2 -opatient -owisdom -r 300000 -t 1 -s ocf8192 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |