* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 96
Number of walkers per rank = 96
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0569 0.0569 1 0.056893916
ParticleSet:::update 0.0000 0.0000 1 0.000002200
Total 144.5607 1.0262 1 144.560721103
Diffusion 90.1608 0.0812 5 18.032155144
Complete Updates 0.9230 0.0000 5 0.184604399
DeterminantRef::update 0.9230 0.9230 10 0.092298029
Current Gradient 4.3271 0.0950 30720 0.000140856
DeterminantRef::ratio 4.1946 4.1946 30720 0.000136544
OneBodyJastrowRef 0.0225 0.0225 30720 0.000000731
TwoBodyJastrowRef 0.0150 0.0150 30720 0.000000488
Kinetic Energy 0.9104 0.9095 5 0.182086242
OneBodyJastrowRef 0.0005 0.0005 5 0.000101462
TwoBodyJastrowRef 0.0004 0.0004 5 0.000087238
New Gradient 13.3974 0.0980 30720 0.000436115
DeterminantRef::ratio 0.0989 0.0989 30720 0.000003219
DeterminantRef::spovgl 12.0078 0.2997 30720 0.000390878
Single-Particle Orbitals 11.7080 11.7080 30720 0.000381121
OneBodyJastrowRef 0.1718 0.1718 30720 0.000005594
TwoBodyJastrowRef 1.0210 1.0210 30720 0.000033236
ParticleSet:::acceptMove 12.9641 0.0426 15371 0.000843413
DTAAOMPTarget::update_e_e 12.8428 12.8428 15371 0.000835521
DTABOMPTarget::update_ion_e 0.0787 0.0787 15371 0.000005119
ParticleSet:::computeNewPosDT 2.4058 0.0486 30720 0.000078314
DTAAOMPTarget::move_e_e 2.1437 2.1437 30720 0.000069783
DTABOMPTarget::move_ion_e 0.2135 0.2135 30720 0.000006948
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001840
Update 55.1516 0.0408 15371 0.003588030
DeterminantRef::update 53.5275 53.5275 15371 0.003482373
OneBodyJastrowRef 0.0062 0.0062 15371 0.000000406
TwoBodyJastrowRef 1.5770 1.5770 15371 0.000102597
Initialization 11.2197 5.7556 1 11.219655124
DeterminantRef::inverse 2.7166 2.7166 2 1.358282035
DeterminantRef::spovgl 2.2558 0.0555 2 1.127908237
Single-Particle Orbitals 2.2004 2.2004 6144 0.000358131
OneBodyJastrowRef 0.0153 0.0153 1 0.015337988
ParticleSet:::update 0.3463 0.2050 2 0.173157316
DTAAOMPTarget::evaluate_e_e 0.1243 0.1243 1 0.124268800
DTABOMPTarget::evaluate_ion_e 0.0170 0.0004 1 0.017016050
DTABOMPTarget::offload_ion_e 0.0166 0.0166 1 0.016591899
TwoBodyJastrowRef 0.1300 0.1300 1 0.130024580
Pseudopotential 42.1541 0.2347 5 8.430810516
DeterminantRef::spoval 30.7197 0.4103 10215 0.003007310
Single-Particle Orbitals 30.3093 30.3093 122580 0.000247262
OneBodyJastrowRef 0.0959 0.0959 10215 0.000009386
ParticleSet:::update 8.1482 0.0350 10215 0.000797673
DTABOMPTarget::evaluate_e_virtual 7.2787 0.0178 10215 0.000712550
DTABOMPTarget::offload_e_virtual 7.2609 7.2609 10215 0.000710812
DTABOMPTarget::evaluate_ion_virtual 0.8346 0.0173 10215 0.000081700
DTABOMPTarget::offload_ion_virtual 0.8173 0.8173 10215 0.000080006
TwoBodyJastrowRef 2.9555 2.9555 10215 0.000289334
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.54019e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.46949e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.59675e+07
Your experiment path is /home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0
To display your profiling results:
####################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
####################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-135-6342/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1741358813/tools/lprof_npsu_run_0 #
####################################################################################################################################################################################################