options

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Metricr0r1r2r3r4r5r6
Total Time (s)54.1926.6315.148.416.164.333.54
Profiled Time (s)52.4724.1413.306.753.722.651.87
Time in analyzed loops (%)94.992.283.983.077.874.062.5
Time in analyzed innermost loops (%)89.486.378.377.472.969.057.0
Time in user code (%)93.690.782.481.476.572.761.0
Compilation Options Score (%)93.792.992.191.292.092.488.5
Array Access Efficiency (%)99.098.998.999.098.999.098.8
Scalability - Gap1.000.981.121.241.822.083.40
Potential Speedups
Perfect Flow Complexity1.001.001.001.001.001.001.00
Perfect OpenMP + MPI + Pthread1.001.031.091.061.081.031.04
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.001.051.171.191.301.451.78
No Scalar IntegerPotential Speedup1.021.021.021.021.021.021.02
Nb Loops to get 80%8788888
FP VectorisedPotential Speedup1.391.381.341.331.301.281.22
Nb Loops to get 80%2222222
Fully VectorisedPotential Speedup1.981.931.791.771.691.631.50
Nb Loops to get 80%5665555
Only FP ArithmeticPotential Speedup1.111.111.111.101.101.091.10
Nb Loops to get 80%10101010101010
OpenMP perfectly balancedPotential Speedup1.001.001.031.041.051.041.03
Nb Loops to get 80%1223332

Scalability Speedup

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If Only FP Arithmetic

Cumulated Speedup if OpenMP Perfectly Balanced

Loop Based Profiles

Innermost / Single Loops

Inbetween Loops

Outermost Loops

Cumulated Coverage With All Loops

Innermost Loop Based Profiles

Coverage

Count

Application Categorization

Time

Coverage

Compilation Options

Source ObjectIssue
libqmckl.so.0.0.0
qmckl_mo.c
qmckl_ao.c
bench_mos
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)

Path Count Profiles

Coverage

Count

Low Iteration Count Profiles

Coverage

Count

Experiment Summaries

r0r1r2r3r4r5r6
Experiment Namem1o1m1o1m1o1m1o1m1o1m1o1m1o1
Application./../qmckl_bench/build/bench_mossame as r0same as r0same as r0same as r0same as r0same as r0
Timestamp2024-02-13 11:52:43same as r0same as r0same as r0same as r0same as r0same as r0
Experiment TypeSequentialOpenMP; same as r1same as r1same as r1same as r1same as r1
Machineskylakesame as r0same as r0same as r0same as r0same as r0same as r0
Architecturex86_64same as r0same as r0same as r0same as r0same as r0same as r0
Micro ArchitectureSKYLAKEsame as r0same as r0same as r0same as r0same as r0same as r0
Model NameIntel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHzsame as r0same as r0same as r0same as r0same as r0same as r0
Cache Size36608 KBsame as r0same as r0same as r0same as r0same as r0same as r0
Number of Cores26same as r0same as r0same as r0same as r0same as r0same as r0
Maximal Frequency2.1 GHzsame as r0same as r0same as r0same as r0same as r0same as r0
OS VersionLinux 6.5.7-arch1-1 #1 SMP PREEMPT_DYNAMIC Tue, 10 Oct 2023 21:10:21 +0000same as r0same as r0same as r0same as r0same as r0same as r0
Architecture used during static analysisx86_64same as r0same as r0same as r0same as r0same as r0same as r0
Micro Architecture used during static analysisSKYLAKEsame as r0same as r0same as r0same as r0same as r0same as r0
Compilation Options bench_mos:
libqmckl.so.0.0.0: Intel(R) C Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I. -I/home/kcamus/comparative/qmckl/qmckl -I./include -I./src -I./include -I/home/kcamus/comparative/qmckl/qmckl/src -I/home/kcamus/comparative/qmckl/qmckl/include -I/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/ -I/home/kcamus/comparative/qmckl/trexio/_install/include -DHAVE_CONFIG_H -DQMCKL_TEST_DIR=\"/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/\" -march=native -ip -Ofast -ftz -finline -fopenmp -mkl=sequential -g -fno-omit-frame-pointer -fopenmp -MT src/qmckl_mo.lo -MD -MP -MF src/.deps/qmckl_mo.Tpo -c -fPIC -DPIC -o src/.libs/qmckl_mo.o
libqmckl.so.0.0.0: Intel(R) C Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I. -I/home/kcamus/comparative/qmckl/qmckl -I./include -I./src -I./include -I/home/kcamus/comparative/qmckl/qmckl/src -I/home/kcamus/comparative/qmckl/qmckl/include -I/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/ -I/home/kcamus/comparative/qmckl/trexio/_install/include -DHAVE_CONFIG_H -DQMCKL_TEST_DIR=\"/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/\" -march=native -ip -Ofast -ftz -finline -fopenmp -mkl=sequential -g -fno-omit-frame-pointer -fopenmp -MT src/qmckl_mo.lo -MD -MP -MF src/.deps/qmckl_mo.Tpo -c -fPIC -DPIC -o src/.libs/qmckl_mo.o
bench_mos:
same as r1same as r1same as r1same as r1same as r1
Number of processes observed1same as r0same as r0same as r0same as r0same as r0same as r0
Number of threads observed1248162652
Frequency Driverintel_cpufreqsame as r0same as r0same as r0same as r0same as r0same as r0
Frequency Governorperformancesame as r0same as r0same as r0same as r0same as r0same as r0
Huge Pagesalwayssame as r0same as r0same as r0same as r0same as r0same as r0
Hyperthreadingoffsame as r0same as r0same as r0same as r0same as r0same as r0
Number of sockets2same as r0same as r0same as r0same as r0same as r0same as r0
Number of cores per socket26same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO version2.19.0same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO buildb37ee48e971324d4eaf9054a5a16e1bfd5003152::20240201-180403same as r0same as r0same as r0same as r0same as r0same as r0
Comments-------
×