OV - - Global

kripke_aocc_v2 - 2025-07-09 12:01:41 - MAQAO 2025.1.1

Help is available by moving the cursor above any symbol or by checking MAQAO website.

▶Filter Information

There is no filter information to display

Global Metrics

Total Time (s)		275.91
Max (Thread Active Time) (s)		273.90
Average Active Time (s)		273.66
Activity Ratio (%)		99.2
Average number of active threads		7.935
Affinity Stability (%)		83.9
GFLOPS		26.264
Time in analyzed loops (%)		98.8
Time in analyzed innermost loops (%)		27.5
Time in user code (%)		98.9
Compilation Options Score (%)		66.7
Array Access Efficiency (%)		59.1

Potential Speedups
Perfect Flow Complexity		1.00
Perfect OpenMP/MPI/Pthread/TBB		1.00
Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution		1.01
No Scalar Integer	Potential Speedup	1.48
No Scalar Integer	Nb Loops to get 80%	1
FP Vectorised	Potential Speedup	2.14
FP Vectorised	Nb Loops to get 80%	2
Fully Vectorised	Potential Speedup	8.47
Fully Vectorised	Nb Loops to get 80%	4
FP Arithmetic Only	Potential Speedup	1.84
FP Arithmetic Only	Nb Loops to get 80%	1
OpenMP perfectly balanced	Potential Speedup	1.00
OpenMP perfectly balanced	Nb Loops to get 80%	1

CQA Potential Speedups Summary

Average Active Threads Count⏎

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Source Object	Issue
▼kripke_aocc_v2–
○Collapse.hpp	-march=(target) is missing.
○SweepSolver.cpp	-march=(target) is missing.
○ParallelComm.cpp	-march=(target) is missing.
○SweepComm.cpp	-march=(target) is missing.

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If FP Arithmetic Only⏎

Cumulated Speedup If OpenMP perfetecly balanced⏎

Experiment Summary

Experiment Name
Application	/beegfs/hackathon/users/eoseret/kripke_aocc_v2
Timestamp	2025-07-09 12:01:41	Universal Timestamp	1752055301
Number of processes observed	8	Number of threads observed	8
Experiment Type	MPI;
Machine	gmz17.benchmarkcenter.megware.com
Model Name	AMD EPYC 9655 96-Core Processor
Architecture	x86_64	Micro Architecture	ZEN_V5
Cache Size	1024 KB	Number of Cores	96
OS Version	Linux 5.14.0-503.19.1.el9_5.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Jan 7 17:08:27 EST 2025
Architecture used during static analysis	x86_64	Micro Architecture used during static analysis	ZEN_V5
Frequency Driver	acpi-cpufreq	Frequency Governor	performance
Huge Pages	always	Hyperthreading	on
Number of sockets	2	Number of cores per socket	96
Compilation Options	kripke_aocc_v2: AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -I /beegfs/hackathon/users/eoseret/Kripke/src -I /beegfs/hackathon/users/eoseret/Kripke/build/include -I /beegfs/hackathon/users/eoseret/Kripke/tpl/raja/include -I /beegfs/hackathon/users/eoseret/Kripke/build/tpl/raja/include -I /beegfs/hackathon/users/eoseret/Kripke/tpl/raja/tpl/camp/include -I /beegfs/hackathon/users/eoseret/Kripke/build/tpl/raja/tpl/camp/include -isystem /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -g -grecord-command-line -fno-omit-frame-pointer -O3 -D NDEBUG -std=c++14 -fPIC -fopenmp=libomp -MD -MT CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Kernel/Scattering.cpp.o -c /beegfs/hackathon/users/eoseret/Kripke/src/Kripke/Kernel/Scattering.cpp AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -I /beegfs/hackathon/users/eoseret/Kripke/src -I /beegfs/hackathon/users/eoseret/Kripke/build/include -I /beegfs/hackathon/users/eoseret/Kripke/tpl/raja/include -I /beegfs/hackathon/users/eoseret/Kripke/build/tpl/raja/include -I /beegfs/hackathon/users/eoseret/Kripke/tpl/raja/tpl/camp/include -I /beegfs/hackathon/users/eoseret/Kripke/build/tpl/raja/tpl/camp/include -isystem /cluster/intel/oneapi/2024.0.0/mpi/2021.11/include -g -grecord-command-line -fno-omit-frame-pointer -O3 -D NDEBUG -std=c++14 -fPIC -fopenmp=libomp -MD -MT CMakeFiles/kripke.dir/src/Kripke/Timing.cpp.o -MF CMakeFiles/kripke.dir/src/Kripke/Timing.cpp.o.d -o CMakeFiles/kripke.dir/src/Kripke/Timing.cpp.o -c /beegfs/hackathon/users/eoseret/Kripke/src/Kripke/Timing.cpp
Comments

Configuration Summary

Dataset
Run Command	<executable> --groups 1024 --zones 24,16,16 --procs 2,2,2
MPI Command	mpirun -n <number_processes>
Number Processes	8
Number Nodes	1
Number Processes per Node	8
Filter	Not Used
Profile Start	Not Used
Maximal Path Number	4

Report Configuration

kripke_aocc_v2 - 2025-07-09 12:01:41 - MAQAO 2025.1.1

▶Filter Information

Global Metrics

CQA Potential Speedups Summary

Average Active Threads Count⏎

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If FP Arithmetic Only⏎

Cumulated Speedup If OpenMP perfetecly balanced⏎

Experiment Summary

Configuration Summary