* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 617176)miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 384
Tile size = 384
Number of tiles = 1
Number of electrons = 768
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 196608000 bytes (187.5 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.1309 0.1309 1 0.130877018
Total 1.6151 0.0002 1 1.615062952
Diffusion 0.6007 0.0030 5 0.120130777
Accept move 0.0015 0.0015 1913 0.000000760
Complete Updates 0.0023 0.0000 5 0.000466824
DeterminantRef::update 0.0023 0.0023 10 0.000232458
Current Gradient 0.0198 0.0018 3840 0.000005163
DeterminantRef::ratio 0.0169 0.0169 3840 0.000004402
OneBodyJastrowRef 0.0007 0.0007 3840 0.000000169
TwoBodyJastrowRef 0.0005 0.0005 3840 0.000000132
Kinetic Energy 0.0061 0.0060 5 0.001214409
OneBodyJastrowRef 0.0000 0.0000 5 0.000006628
TwoBodyJastrowRef 0.0000 0.0000 5 0.000005007
Make move 0.0584 0.0584 3840 0.000015209
New Gradient 0.3409 0.0023 3840 0.000088780
DeterminantRef::ratio 0.0038 0.0038 3840 0.000000979
DeterminantRef::spovgl 0.2947 0.0210 3840 0.000076747
Single-Particle Orbitals 0.2737 0.2737 3840 0.000071288
OneBodyJastrowRef 0.0021 0.0021 3840 0.000000556
TwoBodyJastrowRef 0.0380 0.0380 3840 0.000009891
Set active 0.0719 0.0719 3840 0.000018721
Update 0.0967 0.0012 1913 0.000050575
DeterminantRef::update 0.0581 0.0581 1913 0.000030350
OneBodyJastrowRef 0.0003 0.0003 1913 0.000000162
TwoBodyJastrowRef 0.0371 0.0371 1913 0.000019413
Initialization 0.1318 0.0540 1 0.131827116
DeterminantRef::inverse 0.0204 0.0204 2 0.010208845
DeterminantRef::spovgl 0.0543 0.0022 2 0.027131557
Single-Particle Orbitals 0.0521 0.0521 768 0.000067854
OneBodyJastrowRef 0.0003 0.0003 1 0.000347137
TwoBodyJastrowRef 0.0028 0.0028 1 0.002774954
Pseudopotential 0.8824 0.0059 5 0.176483440
Make move 0.2068 0.2068 15792 0.000013095
Value 0.6698 0.0493 15792 0.000042411
DeterminantRef::ratio 0.0049 0.0049 15792 0.000000311
DeterminantRef::spoval 0.5659 0.0043 15792 0.000035834
Single-Particle Orbitals 0.5616 0.5616 15792 0.000035560
OneBodyJastrowRef 0.0312 0.0312 15792 0.000001976
TwoBodyJastrowRef 0.0185 0.0185 15792 0.000001171
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.80475e+08
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 7.54153e+08
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 668419
* Info: Process finished (host skylake, process 617176)
Your experiment path is /home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0
To display your profiling results:
#####################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/orig/oneview_results_1693904207/tools/lprof_npsu_run_0 #
#####################################################################################################################################################################################