Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 18.98 | |
| Max (Thread Active Time) (s) | 18.93 | |
| Average Active Time (s) | 18.93 | |
| Activity Ratio (%) | 99.7 | |
| Average number of active threads | 0.997 | |
| Affinity Stability (%) | 99.9 | |
| GFLOPS | 3.232 | |
| Time in analyzed loops (%) | 90.1 | |
| Time in analyzed innermost loops (%) | 90.1 | |
| Time in user code (%) | 98.9 | |
| Compilation Options Score (%) | 75.0 | |
| Array Access Efficiency (%) | 53.3 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| FP Vectorised | Potential Speedup | 1.11 | |
| Nb Loops to get 80% | 1 | |
| Fully Vectorised | Potential Speedup | 1.13 | |
| Nb Loops to get 80% | 2 | |
| FP Arithmetic Only | Potential Speedup | 1.63 | |
| Nb Loops to get 80% | 3 | |
| Source Object | Issue |
| ▼bench– | |
| ○vrank-geq1.c | -funroll-loops is missing. |
| ○t1fv_4.c | -funroll-loops is missing. |
| ○dftw-direct.c | -funroll-loops is missing. |
| ○fftw-bench.c | -funroll-loops is missing. |
| ○execute.c | -funroll-loops is missing. |
| ○solve.c | -funroll-loops is missing. |
| ○ct.c | -funroll-loops is missing. |
| ○t1fv_16.c | -funroll-loops is missing. |
| ○direct.c | -funroll-loops is missing. |
| ○n2fv_4.c | -funroll-loops is missing. |
| Experiment Name | FFTW GCC-512TVB-G3-SVE |
| Application | /home/fmusial/FFTW_Benchmarks/fftw-3.3.10-gcc-512tvb-G3-sve/tests/bench |
| Timestamp | 2025-04-04 13:50:06 |
Universal Timestamp | 1743774606 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | Sequential |
| Machine | ip-172-31-18-66 |
| Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V1 |
| OS Version | Linux 5.15.0-1081-aws #88~20.04.1-Ubuntu SMP Fri Mar 28 14:48:25 UTC 2025 |
| Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V1 |
| Frequency Driver | NA |
Frequency Governor | NA |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 64 |
| Compilation Options | bench: GNU C17 14.2.0 -march=armv8.4-a+sve -mtune=neoverse-512tvb -msve-vector-bits=256 -mlittle-endian -mabi=lp64 -g -O3 -fno-omit-frame-pointer | | |
| Dataset | |
| Run Command | <executable> -v2 -opatient -owisdom -r 1200000 -t 1 -s ocf256 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |