options

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Metricr0r1r2r3r4r5r6
Total Time (s)19.539.635.242.791.571.140.87
Profiled Time (s)19.259.415.022.571.350.910.63
Time in analyzed loops (%)97.595.891.189.084.478.467.2
Time in analyzed innermost loops (%)86.584.780.679.074.468.959.4
Time in user code (%)93.091.186.384.379.373.860.3
Compilation Options Score (%)93.192.192.192.291.291.287.1
Array Access Efficiency (%)98.898.698.798.798.598.498.5
Scalability - Gap1.000.991.071.141.291.512.31
Potential Speedups
Perfect Flow Complexity1.001.001.001.011.001.011.01
Perfect OpenMP + MPI + Pthread1.001.011.051.051.051.051.20
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.001.011.071.101.151.241.55
No Scalar IntegerPotential Speedup1.051.051.041.041.051.041.03
Nb Loops to get 80%8988101010
FP VectorisedPotential Speedup1.381.371.341.321.301.271.21
Nb Loops to get 80%2222222
Fully VectorisedPotential Speedup2.132.101.991.941.861.751.58
Nb Loops to get 80%9101091098
Only FP ArithmeticPotential Speedup1.211.221.201.211.211.191.19
Nb Loops to get 80%1011111111119
OpenMP perfectly balancedPotential Speedup1.001.001.041.031.021.021.07
Nb Loops to get 80%1323232

Scalability Speedup

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If Only FP Arithmetic

Cumulated Speedup if OpenMP Perfectly Balanced

Loop Based Profiles

Innermost / Single Loops

Inbetween Loops

Outermost Loops

Cumulated Coverage With All Loops

Innermost Loop Based Profiles

Coverage

Count

Application Categorization

Time

Coverage

Compilation Options

Source ObjectIssue
libqmckl.so.0.0.0
qmckl_mo.c
qmckl_ao.c
bench_mos
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)

Path Count Profiles

Coverage

Count

Low Iteration Count Profiles

Coverage

Count

Experiment Summaries

r0r1r2r3r4r5r6
Experiment Name
Application./../qmckl_bench/build23feb/bench_mossame as r0same as r0same as r0same as r0same as r0same as r0
Timestamp2024-02-27 14:59:18same as r0same as r0same as r0same as r0same as r0same as r0
Experiment TypeSequentialOpenMP; same as r1same as r1same as r1same as r1same as r1
Machineskylakesame as r0same as r0same as r0same as r0same as r0same as r0
Architecturex86_64same as r0same as r0same as r0same as r0same as r0same as r0
Micro ArchitectureSKYLAKEsame as r0same as r0same as r0same as r0same as r0same as r0
Model NameIntel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHzsame as r0same as r0same as r0same as r0same as r0same as r0
Cache Size36608 KBsame as r0same as r0same as r0same as r0same as r0same as r0
Number of Cores26same as r0same as r0same as r0same as r0same as r0same as r0
Maximal Frequency2.1 GHzsame as r0same as r0same as r0same as r0same as r0same as r0
OS VersionLinux 6.5.7-arch1-1 #1 SMP PREEMPT_DYNAMIC Tue, 10 Oct 2023 21:10:21 +0000same as r0same as r0same as r0same as r0same as r0same as r0
Architecture used during static analysisx86_64same as r0same as r0same as r0same as r0same as r0same as r0
Micro Architecture used during static analysisSKYLAKEsame as r0same as r0same as r0same as r0same as r0same as r0
Compilation Options bench_mos:
libqmckl.so.0.0.0: Intel(R) C Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I. -I./include -I./src -I./include -I./src -I./include -I/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libqmckl/src/libqmckl/share/qmckl/test_data/ -I/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libtrexio/__install/include -DHAVE_CONFIG_H -DQMCKL_TEST_DIR=\"/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libqmckl/src/libqmckl/share/qmckl/test_data/\" -march=native -ip -Ofast -ftz -finline -fopenmp -mkl=sequential -g -fno-omit-frame-pointer -fopenmp -MT src/qmckl_mo.lo -MD -MP -MF src/.deps/qmckl_mo.Tpo -c -fPIC -DPIC -o src/.libs/qmckl_mo.o
same as r0same as r0same as r0same as r0same as r0same as r0
Number of processes observed1same as r0same as r0same as r0same as r0same as r0same as r0
Number of threads observed1248162652
Frequency Driverintel_cpufreqsame as r0same as r0same as r0same as r0same as r0same as r0
Frequency Governorperformancesame as r0same as r0same as r0same as r0same as r0same as r0
Huge Pagesalwayssame as r0same as r0same as r0same as r0same as r0same as r0
Hyperthreadingoffsame as r0same as r0same as r0same as r0same as r0same as r0
Number of sockets2same as r0same as r0same as r0same as r0same as r0same as r0
Number of cores per socket26same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO version2.19.2same as r0same as r0same as r0same as r0same as r0same as r0
MAQAO build3bbb4852d00626c5c928d5040b0823ea18d42c9d::20240224-221406same as r0same as r0same as r0same as r0same as r0same as r0
Commentssame as r0same as r0same as r0same as r0same as r0same as r0
×