Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 728.99 | |
| Max (Thread Active Time) (s) | 721.50 | |
| Average Active Time (s) | 721.50 | |
| Activity Ratio (%) | 99.0 | |
| Average number of active threads | 1.000 | |
| Affinity Stability (%) | 99.3 | |
| GFLOPS | 7.025 | |
| Time in analyzed loops (%) | 99.7 | |
| Time in analyzed innermost loops (%) | 99.7 | |
| Time in user code (%) | 100.0 | |
| Compilation Options Score (%) | 75.0 | |
| Array Access Efficiency (%) | 60.7 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.10 | |
| Nb Loops to get 80% | 2 | |
| FP Vectorised | Potential Speedup | 1.98 | |
| Nb Loops to get 80% | 3 | |
| Fully Vectorised | Potential Speedup | 4.13 | |
| Nb Loops to get 80% | 3 | |
| FP Arithmetic Only | Potential Speedup | 1.15 | |
| Nb Loops to get 80% | 2 | |
| Source Object | Issue |
| ▼bench– | |
| ○vrank-geq1.c | -funroll-loops is missing. |
| ○t2fv_8.c | -funroll-loops is missing. |
| ○ct.c | -funroll-loops is missing. |
| ○direct.c | -funroll-loops is missing. |
| ○dftw-direct.c | -funroll-loops is missing. |
| ○t2fv_16.c | -funroll-loops is missing. |
| ○execute.c | -funroll-loops is missing. |
| ○n1fv_128.c | -funroll-loops is missing. |
| Experiment Name | FFTW GCC-SSE-128 |
| Application | /home/fmusial/FFTW_Benchmarks/fftw-3.3.10-gcc-sse-128/tests/bench |
| Timestamp | 2025-05-12 15:08:21 |
Universal Timestamp | 1747055301 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | Sequential |
| Machine | otterfall |
| Model Name | Intel(R) Xeon(R) Silver 4210R CPU @ 2.40GHz |
| Architecture | x86_64 |
Micro Architecture | SKYLAKE |
| Cache Size | 14080 KB |
Number of Cores | 10 |
| OS Version | Linux 6.12.1-arch1-1 #1 SMP PREEMPT_DYNAMIC Fri, 22 Nov 2024 16:04:27 +0000 |
| Architecture used during static analysis | x86_64 |
Micro Architecture used during static analysis | SKYLAKE |
| Frequency Driver | intel_pstate |
Frequency Governor | performance |
| Huge Pages | always |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 10 |
| Compilation Options | bench: GNU C17 14.2.1 20250207 --param=l1-cache-size=32 --param=l1-cache-line-size=64 --param=l2-cache-size=14080 -mtune=cascadelake -msse2 -march=cascadelake -g -O3 -fno-omit-frame-pointer | | |
| Dataset | |
| Run Command | <executable> -v2 -opatient -owisdom -r 300000 -t 1 -s ocf16384 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |