Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 19.26 | |
| Max (Thread Active Time) (s) | 18.98 | |
| Average Active Time (s) | 18.98 | |
| Activity Ratio (%) | 98.5 | |
| Average number of active threads | 1.000 | |
| Affinity Stability (%) | 100.0 | |
| GFLOPS | 8.691 | |
| Time in analyzed loops (%) | 98.2 | |
| Time in analyzed innermost loops (%) | 98.2 | |
| Time in user code (%) | 99.8 | |
| Compilation Options Score (%) | 75.0 | |
| Array Access Efficiency (%) | 67.4 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.09 | |
| Nb Loops to get 80% | 1 | |
| FP Vectorised | Potential Speedup | 2.02 | |
| Nb Loops to get 80% | 2 | |
| Fully Vectorised | Potential Speedup | 3.81 | |
| Nb Loops to get 80% | 2 | |
| FP Arithmetic Only | Potential Speedup | 1.10 | |
| Nb Loops to get 80% | 1 | |
| Source Object | Issue |
| ▼bench– | |
| ○fftw-bench.c | -funroll-loops is missing. |
| ○direct.c | -funroll-loops is missing. |
| ○ct.c | -funroll-loops is missing. |
| ○t2fv_4.c | -funroll-loops is missing. |
| ○dftw-direct.c | -funroll-loops is missing. |
| ○n2fv_64.c | -funroll-loops is missing. |
| ○execute.c | -funroll-loops is missing. |
| ○solve.c | -funroll-loops is missing. |
| Experiment Name | FFTW GCC-SSE-128 |
| Application | /home/fmusial/FFTW_Benchmarks/fftw-3.3.10-gcc-sse-128/tests/bench |
| Timestamp | 2025-05-12 14:51:01 |
Universal Timestamp | 1747054261 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | Sequential |
| Machine | otterfall |
| Model Name | Intel(R) Xeon(R) Silver 4210R CPU @ 2.40GHz |
| Architecture | x86_64 |
Micro Architecture | SKYLAKE |
| Cache Size | 14080 KB |
Number of Cores | 10 |
| OS Version | Linux 6.12.1-arch1-1 #1 SMP PREEMPT_DYNAMIC Fri, 22 Nov 2024 16:04:27 +0000 |
| Architecture used during static analysis | x86_64 |
Micro Architecture used during static analysis | SKYLAKE |
| Frequency Driver | intel_pstate |
Frequency Governor | performance |
| Huge Pages | always |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 10 |
| Compilation Options | bench: GNU C17 14.2.1 20250207 --param=l1-cache-size=32 --param=l1-cache-line-size=64 --param=l2-cache-size=14080 -mtune=cascadelake -msse2 -march=cascadelake -g -O3 -fno-omit-frame-pointer | | |
| Dataset | |
| Run Command | <executable> -v2 -opatient -owisdom -r 1200000 -t 1 -s ocf256 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |