options

Executable Output


* Info: Detected 1 Lprof instances in ip-172-31-42-13: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Process launched (host ip-172-31-42-13, process 764995)miniqmc not built from git repository

number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 64
Number of walkers per rank = 64

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0789     0.0789              1       0.078940874
  ParticleSet:::update                         0.0000     0.0000              1       0.000002384
Total                                        178.4467    21.1282              1     178.446692057
  Diffusion                                  104.0492     0.0798              5      20.809844055
    Complete Updates                           1.2251     0.0000              5       0.245016112
      DeterminantRef::update                   1.2250     1.2250             10       0.122503453
    Current Gradient                           5.1349     0.0923          30720       0.000167151
      DeterminantRef::ratio                    4.9765     4.9765          30720       0.000161997
      OneBodyJastrowRef                        0.0366     0.0366          30720       0.000001191
      TwoBodyJastrowRef                        0.0295     0.0295          30720       0.000000960
    Kinetic Energy                             0.8950     0.8941              5       0.178996669
      OneBodyJastrowRef                        0.0005     0.0005              5       0.000099057
      TwoBodyJastrowRef                        0.0004     0.0004              5       0.000080660
    New Gradient                              16.4956     0.0881          30720       0.000536966
      DeterminantRef::ratio                    0.1732     0.1732          30720       0.000005637
      DeterminantRef::spovgl                  14.7365     0.2544          30720       0.000479703
        Single-Particle Orbitals              14.4821    14.4821          30720       0.000471422
      OneBodyJastrowRef                        0.2063     0.2063          30720       0.000006716
      TwoBodyJastrowRef                        1.2915     1.2915          30720       0.000042042
    ParticleSet:::acceptMove                  13.8848     0.0525          15371       0.000903314
      DTAAOMPTarget::update_e_e               13.7522    13.7522          15371       0.000894682
      DTABOMPTarget::update_ion_e              0.0802     0.0802          15371       0.000005219
    ParticleSet:::computeNewPosDT              2.4122     0.0609          30720       0.000078523
      DTAAOMPTarget::move_e_e                  2.1284     2.1284          30720       0.000069284
      DTABOMPTarget::move_ion_e                0.2230     0.2230          30720       0.000007258
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000000421
    Update                                    63.9218     0.0356          15371       0.004158595
      DeterminantRef::update                  62.1253    62.1253          15371       0.004041720
      OneBodyJastrowRef                        0.0127     0.0127          15371       0.000000825
      TwoBodyJastrowRef                        1.7482     1.7482          15371       0.000113736
  Initialization                              11.8804     5.8924              1      11.880431322
    DeterminantRef::inverse                    2.6191     2.6191              2       1.309527398
    DeterminantRef::spovgl                     3.0597     0.0443              2       1.529856550
      Single-Particle Orbitals                 3.0154     3.0154           6144       0.000490791
    OneBodyJastrowRef                          0.0060     0.0060              1       0.005974795
    ParticleSet:::update                       0.2091     0.0556              2       0.104566214
      DTAAOMPTarget::evaluate_e_e              0.1224     0.1224              1       0.122411661
      DTABOMPTarget::evaluate_ion_e            0.0311     0.0001              1       0.031141467
        DTABOMPTarget::offload_ion_e           0.0310     0.0310              1       0.031026085
    TwoBodyJastrowRef                          0.0941     0.0941              1       0.094113789
  Pseudopotential                             41.3889     0.1984              5       8.277772242
    DeterminantRef::spoval                    31.9150     0.6582          10215       0.003124325
      Single-Particle Orbitals                31.2568    31.2568         122580       0.000254991
    OneBodyJastrowRef                          0.0989     0.0989          10215       0.000009684
    ParticleSet:::update                       7.2696     0.0347          10215       0.000711663
      DTABOMPTarget::evaluate_e_virtual        6.5186     0.0145          10215       0.000638141
        DTABOMPTarget::offload_e_virtual       6.5041     6.5041          10215       0.000636720
      DTABOMPTarget::evaluate_ion_virtual      0.7163     0.0116          10215       0.000070121
        DTABOMPTarget::offload_ion_virtual     0.7047     0.7047          10215       0.000068990
    TwoBodyJastrowRef                          1.9070     1.9070          10215       0.000186681

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 8.31812e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.42658e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 5.83712e+07




Your experiment path is /home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0

To display your profiling results:
################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                            #
################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas-runs/171-275-7410/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1712759427/tools/lprof_npsu_run_0  #
################################################################################################################################################################################################

×