options
Detailed Application Categorization
Detailed Function Times
Function Based Profile
Pop Metrics
Libraries

Detailed Application Categorization

IDTime(s)Binary(%)MPI(%)OMP(%)TBB(%)Math(%)System(%)Pthread(%)IO(%)String(%)Memory(%)libqmckl.so.0.0.0 (%)Others(%)
run_012.000.040.000.120.000.460.500.040.000.000.2998.460.08
Node skylake12.000.040.000.120.000.460.500.040.000.000.2998.460.08
Process 309112612.000.040.000.120.000.460.500.040.000.000.2998.460.08
Thread 309112612.000.040.000.120.000.460.500.040.000.000.2998.460.08
run_112.050.000.000.080.000.910.500.000.000.000.2198.220.08
Node skylake12.050.000.000.080.000.910.500.000.000.000.2198.220.08
Process 309118512.050.000.000.080.000.910.500.000.000.000.2198.220.08
Thread 309118512.050.000.000.080.000.910.500.000.000.000.2198.220.08
run_211.990.000.000.170.000.920.460.080.000.000.1798.080.13
Node skylake11.990.000.000.170.000.920.460.080.000.000.1798.080.13
Process 309125211.990.000.000.170.000.920.460.080.000.000.1798.080.13
Thread 309125211.990.000.000.170.000.920.460.080.000.000.1798.080.13
run_312.150.000.000.000.000.910.450.000.040.040.1698.350.04
Node skylake12.150.000.000.000.000.910.450.000.040.040.1698.350.04
Process 309132912.150.000.000.000.000.910.450.000.040.040.1698.350.04
Thread 309132912.150.000.000.000.000.910.450.000.040.040.1698.350.04
run_412.150.000.000.160.000.820.410.040.000.000.2598.190.12
Node skylake12.150.000.000.160.000.820.410.040.000.000.2598.190.12
Process 309143212.150.000.000.160.000.820.410.040.000.000.2598.190.12
Thread 309143212.150.000.000.160.000.820.410.040.000.000.2598.190.12
run_512.400.000.000.080.000.770.440.000.000.000.2098.470.04
Node skylake12.400.000.000.080.000.770.440.000.000.000.2098.470.04
Process 309157312.400.000.000.080.000.770.440.000.000.000.2098.470.04
Thread 309157312.400.000.000.080.000.770.440.000.000.000.2098.470.04
run_612.430.000.000.080.000.970.440.000.000.000.1698.310.04
Node skylake12.430.000.000.080.000.970.440.000.000.000.1698.310.04
Process 309178212.430.000.000.080.000.970.440.000.000.000.1698.310.04
Thread 309178212.430.000.000.080.000.970.440.000.000.000.1698.310.04

Detailed Function Times

Function Based Profile

POP metrics

Metricrun_0 run_1 run_2 run_3 run_4 run_5 run_6
Global Efficiency99.83 %99.44 %99.83 %98.66 %98.56 %96.65 %96.46 %
Parallel Efficiency99.83 %99.92 %99.75 %100.00 %99.79 %99.92 %99.92 %
Load Balance100.00 %100.00 %100.00 %100.00 %100.00 %100.00 %100.00 %
Communication Efficiency99.83 %99.92 %99.75 %100.00 %99.79 %99.92 %99.92 %
Computation Scalability100.00 %99.52 %100.08 %98.66 %98.76 %96.72 %96.54 %
IPC Scalability100.00 %99.44 %99.99 %98.61 %98.67 %96.61 %96.42 %
Useful Instructions scalability100.00 %100.09 %100.09 %100.06 %100.09 %100.12 %100.12 %
Frequency Scalability100.00 %100.00 %100.00 %100.00 %100.00 %100.00 %100.00 %
All Instructions scalability100.00 %100.03 %100.06 %100.06 %100.03 %100.06 %100.03 %
OPC Scalability100.00 %96.97 %99.18 %98.96 %99.66 %95.58 %95.67 %
Vectorization Efficiency Scalability100.00 %98.65 %99.97 %99.75 %100.09 %98.83 %98.35 %
Vectorization Intensity Scalability100.00 %96.48 %99.72 %101.10 %102.42 %98.15 %99.81 %

Raw computation metrics
IPC1.34781.34021.34771.32901.32991.30211.2996
OPC2.93582.84692.91172.90532.92592.80592.8087
Average Frequency2.09 GHz2.09 GHz2.09 GHz2.09 GHz2.09 GHz2.09 GHz2.09 GHz
Vectorization Efficiency23.2029 %22.8899 %23.1950 %23.1446 %23.2232 %22.9318 %22.8194 %
Vectorization Intensity26.7580 %25.8158 %26.6839 %27.0533 %27.4058 %26.2618 %26.7083 %

Libraries

  • green cell: the library has been found during the run profiling
  • red cell: the library does not appear in the run profiling
Libraryrun_0run_1run_2run_3run_4run_5run_6
/home/kcamus/Trex/qmckl/qmckl_bench/build_pop/libqmckl/__install/lib/libqmckl.so.0.0.0
/home/kcamus/Trex/qmckl/qmckl_bench/build_pop/libtrexio/__install/lib/libtrexio.so.0.0.0
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so
/opt/intel/oneapi/compiler/2024.2/lib/libifcoremt.so.5
/opt/intel/oneapi/compiler/2024.2/lib/libifport.so.5
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5
/opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so
/usr/lib/ld-linux-x86-64.so.2
/usr/lib/libblas.so.3.12.0
/usr/lib/libc.so.6
/usr/lib/libdl.so.2
/usr/lib/libgcc_s.so.1
/usr/lib/libgfortran.so.5.0.0
/usr/lib/libhdf5.so.320.0.0
/usr/lib/liblapack.so.3.12.0
/usr/lib/libm.so.6
/usr/lib/libpthread.so.0
/usr/lib/librt.so.1
/usr/lib/libsz.so.2.0.1
/usr/lib/libz.so.1.3.1
×