OV - exec - Global

exec - 2023-12-19 15:21:31 - MAQAO 2.18.1

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)		69.15
Profiled Time (s)		67.33
GFLOPS		6.272
Time in analyzed loops (%)		99.9
Time in analyzed innermost loops (%)		99.9
Time in user code (%)		100.0
Compilation Options Score (%)		0
Array Access Efficiency (%)		14.8

Potential Speedups
Perfect Flow Complexity		1.07
Perfect OpenMP + MPI + Pthread		1.00
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution		1.00
No Scalar Integer	Potential Speedup	1.69
No Scalar Integer	Nb Loops to get 80%	22
FP Vectorised	Potential Speedup	1.08
FP Vectorised	Nb Loops to get 80%	4
Fully Vectorised	Potential Speedup	3.07
Fully Vectorised	Nb Loops to get 80%	26
FP Arithmetic Only	Potential Speedup	5.36
FP Arithmetic Only	Nb Loops to get 80%	27

CQA Potential Speedups Summary

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Source Object	Issue
▼exec–
○calc_dt.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○advec_cell.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○accelerate.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○initialise_chunk.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○reset_field.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○revert.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○viscosity.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○flux_calc.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○PdV.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○update_halo.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○build_field.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○generate_chunk.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○field_summary.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○advec_mom.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
○ideal_gas.cpp	-g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If FP Arithmetic Only⏎

Experiment Summary

Application	/home/kcamus/qaas_runs/170-299-3881/intel/CloverLeafCXX/run/binaries/clang_8/exec
Timestamp	2023-12-19 15:21:31	Universal Timestamp	1702999291
Number of processes observed	1	Number of threads observed	1
Experiment Type	MPI; OpenMP;
Machine	ip-172-31-68-94
Model Name	AMD EPYC 9R14 96-Core Processor
Architecture	x86_64	Micro Architecture	ZEN_V4
Cache Size	1024 KB	Number of Cores	96
OS Version	Linux 6.2.0-1017-aws #17~22.04.1-Ubuntu SMP Fri Nov 17 21:07:13 UTC 2023
Architecture used during static analysis	x86_64	Micro Architecture used during static analysis	ZEN_V4
Frequency Driver	acpi-cpufreq	Frequency Governor	performance
Huge Pages	madvise	Hyperthreading	off
Number of sockets	2	Number of cores per socket	96
Compilation Options	exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10)

Configuration Summary

Dataset
Run Command	<executable>
MPI Command	mpirun --bind-to none -np 1
Number Processes	1
Number Nodes	1
Filter	{type = number ; value = 1 ; }
Profile Start	{unit = none ; value = 0 ; }

Report Configuration

exec - 2023-12-19 15:21:31 - MAQAO 2.18.1

Global Metrics

CQA Potential Speedups Summary

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If FP Arithmetic Only⏎

Experiment Summary

Configuration Summary