options

Executable Output


* Info: Detected 2 Lprof instances in o404: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Selecting the 'perf-high-ppn' engine for node o404

* Info: Process launched (host o404, process 426573)
* Info: Process launched (host o404, process 426574)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 56
Number of walkers per rank = 56

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1363     0.1362              1       0.136251656
  ParticleSet:::update                         0.0000     0.0000              1       0.000003503
Total                                         86.4514     0.0094              1      86.451412598
  Diffusion                                   44.6336     0.0446              5       8.926711205
    Complete Updates                           0.3400     0.0000              5       0.067996866
      DeterminantRef::update                   0.3400     0.3400             10       0.033996194
    Current Gradient                           2.4576     0.0367          30720       0.000079999
      DeterminantRef::ratio                    2.3994     2.3994          30720       0.000078105
      OneBodyJastrowRef                        0.0135     0.0135          30720       0.000000440
      TwoBodyJastrowRef                        0.0080     0.0080          30720       0.000000260
    Kinetic Energy                             0.6213     0.6209              5       0.124254079
      OneBodyJastrowRef                        0.0003     0.0003              5       0.000054577
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000029251
    New Gradient                              11.5775     0.0396          30720       0.000376871
      DeterminantRef::ratio                    0.3635     0.3635          30720       0.000011834
      DeterminantRef::spovgl                   9.8372     0.6692          30720       0.000320221
        Single-Particle Orbitals               9.1680     9.1680          30720       0.000298439
      OneBodyJastrowRef                        0.1386     0.1386          30720       0.000004510
      TwoBodyJastrowRef                        1.1986     1.1986          30720       0.000039018
    ParticleSet:::acceptMove                   4.5240     0.0239          15371       0.000294322
      DTAAOMPTarget::update_e_e                4.4496     4.4496          15371       0.000289480
      DTABOMPTarget::update_ion_e              0.0505     0.0505          15371       0.000003284
    ParticleSet:::computeNewPosDT              1.6108     0.0282          30720       0.000052436
      DTAAOMPTarget::move_e_e                  1.4279     1.4279          30720       0.000046482
      DTABOMPTarget::move_ion_e                0.1547     0.1547          30720       0.000005036
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002298
    Update                                    23.4578     0.0200          15371       0.001526105
      DeterminantRef::update                  22.2183    22.2183          15371       0.001445468
      OneBodyJastrowRef                        0.0044     0.0044          15371       0.000000288
      TwoBodyJastrowRef                        1.2151     1.2151          15371       0.000079050
  Initialization                               8.2188     4.9207              1       8.218784539
    DeterminantRef::inverse                    1.2389     1.2389              2       0.619427970
    DeterminantRef::spovgl                     1.6582     0.1281              2       0.829105732
      Single-Particle Orbitals                 1.5301     1.5301           6144       0.000249034
    OneBodyJastrowRef                          0.0185     0.0185              1       0.018486082
    ParticleSet:::update                       0.2471     0.0732              2       0.123559225
      DTAAOMPTarget::evaluate_e_e              0.1404     0.1404              1       0.140387519
      DTABOMPTarget::evaluate_ion_e            0.0336     0.0001              1       0.033558400
        DTABOMPTarget::offload_ion_e           0.0334     0.0334              1       0.033448171
    TwoBodyJastrowRef                          0.1355     0.1355              1       0.135456133
  Pseudopotential                             33.5897     0.1087              5       6.717938853
    DeterminantRef::spoval                    24.1937     0.5641          10215       0.002368444
      Single-Particle Orbitals                23.6296    23.6296         122580       0.000192768
    OneBodyJastrowRef                          0.0626     0.0626          10215       0.000006127
    ParticleSet:::update                       7.9151     0.0246          10215       0.000774850
      DTABOMPTarget::evaluate_e_virtual        7.2619     0.0114          10215       0.000710903
        DTABOMPTarget::offload_e_virtual       7.2504     7.2504          10215       0.000709784
      DTABOMPTarget::evaluate_ion_virtual      0.6286     0.0084          10215       0.000061536
        DTABOMPTarget::offload_ion_virtual     0.6202     0.6202          10215       0.000060714
    TwoBodyJastrowRef                          1.3097     1.3097          10215       0.000128212

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.00469e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.81983e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.25868e+08


* Info: Process finished (host o404, process 426573)
* Info: Process finished (host o404, process 426574)

Info: 1/2 lprof instances finished


Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0

To display your profiling results:
############################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                  COMMAND                                                                                  #
############################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-3180/intel/miniqmc/run/oneview_runs/defaults/orig/oneview_results_1714173782/tools/lprof_npsu_run_0  #
############################################################################################################################################################################################################

×