options

Executable Output


* Info: Detected 2 Lprof instances in o405: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Selecting the 'perf-high-ppn' engine for node o405

* Info: Process launched (host o405, process 152355)
* Info: Process launched (host o405, process 152356)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 56
Number of walkers per rank = 56

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1418     0.1418              1       0.141775197
  ParticleSet:::update                         0.0000     0.0000              1       0.000005836
Total                                        118.9431     0.0237              1     118.943057722
  Diffusion                                   68.1321     0.0678              5      13.626428428
    Complete Updates                           0.3954     0.0000              5       0.079078291
      DeterminantRef::update                   0.3954     0.3954             10       0.039535657
    Current Gradient                           3.0235     0.0405          30720       0.000098420
      DeterminantRef::ratio                    2.9543     2.9543          30720       0.000096168
      OneBodyJastrowRef                        0.0176     0.0176          30720       0.000000573
      TwoBodyJastrowRef                        0.0111     0.0111          30720       0.000000361
    Kinetic Energy                             0.6076     0.6071              5       0.121514398
      OneBodyJastrowRef                        0.0003     0.0003              5       0.000053650
      TwoBodyJastrowRef                        0.0002     0.0002              5       0.000033864
    New Gradient                              20.4691     0.0476          30720       0.000666311
      DeterminantRef::ratio                    0.3774     0.3774          30720       0.000012285
      DeterminantRef::spovgl                  18.3546     0.8060          30720       0.000597480
        Single-Particle Orbitals              17.5485    17.5485          30720       0.000571241
      OneBodyJastrowRef                        0.1789     0.1789          30720       0.000005822
      TwoBodyJastrowRef                        1.5107     1.5107          30720       0.000049175
    ParticleSet:::acceptMove                   7.6591     0.0372          15371       0.000498282
      DTAAOMPTarget::update_e_e                7.5338     7.5338          15371       0.000490133
      DTABOMPTarget::update_ion_e              0.0881     0.0881          15371       0.000005729
    ParticleSet:::computeNewPosDT              1.9571     0.0364          30720       0.000063708
      DTAAOMPTarget::move_e_e                  1.7202     1.7202          30720       0.000055995
      DTABOMPTarget::move_ion_e                0.2005     0.2005          30720       0.000006527
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002985
    Update                                    33.9526     0.0307          15371       0.002208874
      DeterminantRef::update                  31.8950    31.8950          15371       0.002075009
      OneBodyJastrowRef                        0.0051     0.0051          15371       0.000000334
      TwoBodyJastrowRef                        2.0219     2.0219          15371       0.000131537
  Initialization                              10.3918     4.9064              1      10.391806799
    DeterminantRef::inverse                    2.0350     2.0350              2       1.017503817
    DeterminantRef::spovgl                     2.7622     0.1382              2       1.381079981
      Single-Particle Orbitals                 2.6239     2.6239           6144       0.000427075
    OneBodyJastrowRef                          0.0159     0.0159              1       0.015932371
    ParticleSet:::update                       0.5442     0.0886              2       0.272102536
      DTAAOMPTarget::evaluate_e_e              0.4201     0.4201              1       0.420077375
      DTABOMPTarget::evaluate_ion_e            0.0356     0.0001              1       0.035564214
        DTABOMPTarget::offload_ion_e           0.0354     0.0354              1       0.035443174
    TwoBodyJastrowRef                          0.1281     0.1281              1       0.128098072
  Pseudopotential                             40.3954     0.1454              5       8.079084304
    DeterminantRef::spoval                    28.9537     0.7285          10215       0.002834434
      Single-Particle Orbitals                28.2252    28.2252         122580       0.000230259
    OneBodyJastrowRef                          0.0919     0.0919          10215       0.000008993
    ParticleSet:::update                       9.0609     0.0309          10215       0.000887014
      DTABOMPTarget::evaluate_e_virtual        8.2632     0.0131          10215       0.000808931
        DTABOMPTarget::offload_e_virtual       8.2501     8.2501          10215       0.000807645
      DTABOMPTarget::evaluate_ion_virtual      0.7667     0.0103          10215       0.000075058
        DTABOMPTarget::offload_ion_virtual     0.7564     0.7564          10215       0.000074053
    TwoBodyJastrowRef                          2.1436     2.1436          10215       0.000209845

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.1839e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.81259e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.04662e+08


* Info: Process finished (host o405, process 152355)
* Info: Process finished (host o405, process 152356)

Info: 1/2 lprof instances finished


Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0

To display your profiling results:
##############################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                   COMMAND                                                                                   #
##############################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-8059/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_1714192808/tools/lprof_npsu_run_0  #
##############################################################################################################################################################################################################

×