options

Executable Output


* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 645735)miniqmc not built from git repository

number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 384
Tile size = 384
Number of tiles = 1
Number of electrons = 768
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 196608000 bytes (187.5 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 
Stack timer profile
Timer                             Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                0.1018     0.1018              1       0.101776123
Total                                0.6550     0.0001              1       0.655049086
  Diffusion                          0.2426     0.0053              5       0.048517799
    Accept move                      0.0015     0.0015           1913       0.000000763
    Complete Updates                 0.0022     0.0000              5       0.000437403
      DeterminantRef::update         0.0022     0.0022             10       0.000218225
    Current Gradient                 0.0183     0.0019           3840       0.000004754
      DeterminantRef::ratio          0.0153     0.0153           3840       0.000003982
      OneBodyJastrowRef              0.0006     0.0006           3840       0.000000161
      TwoBodyJastrowRef              0.0004     0.0004           3840       0.000000114
    Kinetic Energy                   0.0053     0.0053              5       0.001064825
      OneBodyJastrowRef              0.0000     0.0000              5       0.000005436
      TwoBodyJastrowRef              0.0000     0.0000              5       0.000003624
    Make move                        0.0174     0.0174           3840       0.000004541
    New Gradient                     0.1324     0.0023           3840       0.000034491
      DeterminantRef::ratio          0.0037     0.0037           3840       0.000000973
      DeterminantRef::spovgl         0.1147     0.0076           3840       0.000029862
        Single-Particle Orbitals     0.1071     0.1071           3840       0.000027894
      OneBodyJastrowRef              0.0019     0.0019           3840       0.000000484
      TwoBodyJastrowRef              0.0099     0.0099           3840       0.000002577
    Set active                       0.0190     0.0190           3840       0.000004953
    Update                           0.0412     0.0012           1913       0.000021536
      DeterminantRef::update         0.0311     0.0311           1913       0.000016256
      OneBodyJastrowRef              0.0003     0.0003           1913       0.000000180
      TwoBodyJastrowRef              0.0086     0.0086           1913       0.000004491
  Initialization                     0.0719     0.0285              1       0.071857929
    DeterminantRef::inverse          0.0127     0.0127              2       0.006327510
    DeterminantRef::spovgl           0.0282     0.0022              2       0.014075041
      Single-Particle Orbitals       0.0259     0.0259            768       0.000033737
    OneBodyJastrowRef                0.0003     0.0003              1       0.000308990
    TwoBodyJastrowRef                0.0023     0.0023              1       0.002251148
  Pseudopotential                    0.3405     0.0057              5       0.068096447
    Make move                        0.0720     0.0720          15792       0.000004558
    Value                            0.2628     0.0095          15792       0.000016639
      DeterminantRef::ratio          0.0036     0.0036          15792       0.000000227
      DeterminantRef::spoval         0.2302     0.0045          15792       0.000014577
        Single-Particle Orbitals     0.2257     0.2257          15792       0.000014295
      OneBodyJastrowRef              0.0043     0.0043          15792       0.000000273
      TwoBodyJastrowRef              0.0151     0.0151          15792       0.000000959

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 6.91528e+08
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.86729e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.73232e+06


* Info: Process finished (host skylake, process 645735)

Your experiment path is /home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0

To display your profiling results:
##############################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                           COMMAND                                                                           #
##############################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0  #
##############################################################################################################################################################################################

×