Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Total Time (s) | 22.26 | |
| Profiled Time (s) | 20.87 | |
| GFLOPS | 4.619 | |
| Time in analyzed loops (%) | 99.8 | |
| Time in analyzed innermost loops (%) | 99.7 | |
| Time in user code (%) | 99.9 | |
| Compilation Options Score (%) | 100 | |
| Array Access Efficiency (%) | 82.3 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.18 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.02 | |
| Nb Loops to get 80% | 4 | |
| FP Vectorised | Potential Speedup | 1.36 | |
| Nb Loops to get 80% | 4 | |
| Fully Vectorised | Potential Speedup | 1.51 | |
| Nb Loops to get 80% | 5 | |
| FP Arithmetic Only | Potential Speedup | 1.18 | |
| Nb Loops to get 80% | 8 | |
| Source Object | Issue |
| ▼exec– | |
| ○accelerate_kernel.f90-pp.f90 | |
| ○ideal_gas_kernel.f90-pp.f90 | |
| ○initialise_chunk_kernel.f90-pp.f90 | |
| ○viscosity_kernel.f90-pp.f90 | |
| ○advec_mom_kernel.f90-pp.f90 | |
| ○calc_dt_kernel.f90-pp.f90 | |
| ○build_field.f90-pp.f90 | |
| ○field_summary_kernel.f90-pp.f90 | |
| ○generate_chunk_kernel.f90-pp.f90 | |
| ○flux_calc_kernel.f90-pp.f90 | |
| ○PdV_kernel.f90-pp.f90 | |
| ○update_halo_kernel.f90-pp.f90 | |
| ○revert_kernel.f90-pp.f90 | |
| ○advec_cell_kernel.f90-pp.f90 | |
| ○reset_field_kernel.f90-pp.f90 | |
| Application | /home/kcamus/qaas_runs/170-300-1173/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | | |
| Timestamp | 2023-12-19 15:57:35 |
Universal Timestamp | 1703001455 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | MPI; OpenMP; | | |
| Machine | ip-172-31-68-94 | | |
| Model Name | AMD EPYC 9R14 96-Core Processor | | |
| Architecture | x86_64 |
Micro Architecture | ZEN_V4 |
| Cache Size | 1024 KB |
Number of Cores | 96 |
| OS Version | Linux 6.2.0-1017-aws #17~22.04.1-Ubuntu SMP Fri Nov 17 21:07:13 UTC 2023 | | |
| Architecture used during static analysis | x86_64 |
Micro Architecture used during static analysis | ZEN_V4 |
| Frequency Driver | acpi-cpufreq |
Frequency Governor | performance |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 2 |
Number of cores per socket | 96 |
| Compilation Options | exec: F90 Flang - 1.5 2017-05-01 '+flang -I/home/kcamus/qaas_runs/170-300-1173/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -O3 -march=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -fopenmp -c -o -I/home/kcamus/openmpi/openmpi-5.0.0/_install/include -I/home/kcamus/openmpi/openmpi-5.0.0/_install/lib' | | |
| Dataset | |
| Run Command | <executable> |
| MPI Command | mpirun --bind-to none -np 1 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | {type = number ; value = 1 ; } |
| Profile Start | {unit = none ; value = 0 ; } |