options

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Metricr0r1r2r3r4r5r6r7r8
Total Time (s)593.56321.79210.16137.15122.8652.8437.029.4110.69
Profiled Time (s)593.56321.79210.16137.15122.8652.7136.899.3010.49
Time in analyzed loops (%)99.799.599.598.896.784.973.055.736.5
Time in analyzed innermost loops (%)34.135.640.043.546.335.029.513.010.8
Time in user code (%)99.799.599.598.896.784.973.156.036.7
Compilation Options Score (%)75.075.075.075.075.075.075.075.075.0
Array Access Efficiency (%)100.0100.0100.0100.0100.0100.0100.0100.0100.0
Scalability - Gap1.001.081.421.853.312.853.241.653.75
Potential Speedups
Perfect Flow Complexity1.001.001.001.001.001.001.001.001.00
Perfect OpenMP + MPI + Pthread1.001.001.001.011.021.061.101.401.84
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.001.011.011.021.041.211.422.093.11
No Scalar IntegerPotential Speedup1.381.361.331.301.271.261.221.221.12
Nb Loops to get 80%111111111
FP VectorisedPotential Speedup2.102.072.001.941.861.741.581.461.25
Nb Loops to get 80%222222211
Fully VectorisedPotential Speedup5.124.964.684.363.913.012.361.871.43
Nb Loops to get 80%222222222
Only FP ArithmeticPotential Speedup1.721.691.611.541.471.471.381.371.20
Nb Loops to get 80%111111111

Scalability Speedup

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If Only FP Arithmetic

Loop Based Profiles

Innermost / Single Loops

Inbetween Loops

Outermost Loops

Cumulated Coverage With All Loops

Innermost Loop Based Profiles

Coverage

Count

Application Categorization

Time

Coverage

Compilation Options

Source ObjectIssue
spmxv.exe
ooo_cmdline.h
-funroll-loops is missing.
ooo_cmdline.cpp
-funroll-loops is missing.
main.cpp
-funroll-loops is missing.

Path Count Profiles

Coverage

Count

Low Iteration Count Profiles

Coverage

Count

Experiment Summaries

r0r1r2r3r4r5r6r7r8
Experiment Name
Application./spmxv.exesame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Timestamp2024-07-05 15:34:07same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Experiment TypeSequentialOpenMP; same as r1same as r1same as r1same as r1same as r1same as r1same as r1
Machineifcp01.benchmarkcenter.megware.comsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Architecturex86_64same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Micro ArchitectureSAPPHIRE_RAPIDSsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Model NameIntel(R) Xeon(R) Platinum 8470same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Cache Size107520 KBsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Number of Cores52same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Maximal Frequency3.8 GHzsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
OS VersionLinux 5.14.0-427.18.1.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Tue May 28 06:27:02 EDT 2024same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Architecture used during static analysisx86_64same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Micro Architecture used during static analysisSAPPHIRE_RAPIDSsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Compilation Options spmxv.exe: GNU C++17 13.2.0 -march=sapphirerapids -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512er -mno-avx512pf -mavx512vbmi -mavx512ifma -mno-avx5124vnniw -mno-avx5124fmaps -mavx512vpopcntdq -mavx512vbmi2 -mgfni -mvpclmulqdq -mavx512vnni -mavx512bitalg -mavx512bf16 -mno-avx512vp2intersect -mno-3dnow -madx -mabm -mcldemote -mclflushopt -mclwb -mno-clzero -mcx16 -menqcmd -mf16c -mfsgsbase -mfxsr -mno-hle -msahf -mno-lwp -mlzcnt -mmovbe -mmovdir64b -mmovdiri -mno-mwaitx -mpconfig -mpku -mno-prefetchwt1 -mprfchw -mptwrite -mrdpid -mrdrnd -mrdseed -mno-rtm -mserialize -msgx -msha -mshstk -mno-tbm -mtsxldtrk -mvaes -mwaitpkg -mwbnoinvd -mxsave -mxsavec -mxsaveopt -mxsaves -mamx-tile -mamx-int8 -mamx-bf16 -muintr -mno-hreset -mno-kl -mno-widekl -mavxvnni -mavx512fp16 -mno-avxifma -mno-avxvnniint8 -mno-avxneconvert -mno-cmpccxadd -mno-amx-fp16 -mno-prefetchi -mno-raoint -mno-amx-complex --param=l1-cache-size=48 --param=l1-cache-line-size=64 --param=l2-cache-size=107520 -mtune=sapphirerapids -g -Ofast -fopenmp same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Number of processes observed1same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Number of threads observed1248163252104208
Frequency Driverintel_pstatesame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Frequency Governorperformancesame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Huge Pagesalwayssame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Hyperthreadingonsame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Number of sockets2same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Number of cores per socket52same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO version2.20.4same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO build99be6ccf67bfe415248870770db4aebf733f1e82::20240704-151126same as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
Commentssame as r0same as r0same as r0same as r0same as r0same as r0same as r0same as r0
×