options

Executable Output


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56981)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56986)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1216     0.1216              1       0.121563821
  ParticleSet:::update                         0.0000     0.0000              1       0.000003488
Total                                         41.0901     0.0002              1      41.090138003
  Diffusion                                   21.3957     0.0310              5       4.279149120
    Complete Updates                           0.1687     0.0000              5       0.033747691
      DeterminantRef::update                   0.1687     0.1687             10       0.016872364
    Current Gradient                           1.1224     0.0300          30720       0.000036535
      DeterminantRef::ratio                    1.0766     1.0766          30720       0.000035046
      OneBodyJastrowRef                        0.0086     0.0086          30720       0.000000279
      TwoBodyJastrowRef                        0.0072     0.0072          30720       0.000000235
    Kinetic Energy                             0.2910     0.2908              5       0.058204901
      OneBodyJastrowRef                        0.0001     0.0001              5       0.000028718
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017759
    New Gradient                               5.3153     0.0452          30720       0.000173024
      DeterminantRef::ratio                    0.1623     0.1623          30720       0.000005284
      DeterminantRef::spovgl                   4.4317     0.2651          30720       0.000144260
        Single-Particle Orbitals               4.1666     4.1666          30720       0.000135631
      OneBodyJastrowRef                        0.1085     0.1085          30720       0.000003533
      TwoBodyJastrowRef                        0.5675     0.5675          30720       0.000018475
    ParticleSet:::acceptMove                   1.7243     0.0167          15371       0.000112178
      DTAAOMPTarget::update_e_e                1.6892     1.6892          15371       0.000109897
      DTABOMPTarget::update_ion_e              0.0184     0.0184          15371       0.000001197
    ParticleSet:::computeNewPosDT              0.7248     0.0222          30720       0.000023595
      DTAAOMPTarget::move_e_e                  0.6224     0.6224          30720       0.000020259
      DTABOMPTarget::move_ion_e                0.0803     0.0803          30720       0.000002614
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002534
    Update                                    12.0182     0.0180          15371       0.000781876
      DeterminantRef::update                  11.4449    11.4449          15371       0.000744580
      OneBodyJastrowRef                        0.0035     0.0035          15371       0.000000229
      TwoBodyJastrowRef                        0.5518     0.5518          15371       0.000035896
  Initialization                               1.9651     0.1664              1       1.965094128
    DeterminantRef::inverse                    0.7916     0.7916              2       0.395802906
    DeterminantRef::spovgl                     0.8355     0.0602              2       0.417732546
      Single-Particle Orbitals                 0.7753     0.7753           6144       0.000126186
    OneBodyJastrowRef                          0.0084     0.0084              1       0.008350388
    ParticleSet:::update                       0.0619     0.0074              2       0.030927490
      DTAAOMPTarget::evaluate_e_e              0.0389     0.0389              1       0.038885437
      DTABOMPTarget::evaluate_ion_e            0.0155     0.0001              1       0.015531206
        DTABOMPTarget::offload_ion_e           0.0155     0.0155              1       0.015480491
    TwoBodyJastrowRef                          0.1014     0.1014              1       0.101421842
  Pseudopotential                             17.7291     0.0447              5       3.545818692
    DeterminantRef::spoval                    13.1126     0.2675          10215       0.001283662
      Single-Particle Orbitals                12.8452    12.8452         122580       0.000104790
    OneBodyJastrowRef                          0.0198     0.0198          10215       0.000001938
    ParticleSet:::update                       4.1142     0.0104          10215       0.000402764
      DTABOMPTarget::evaluate_e_virtual        3.7733     0.0051          10215       0.000369391
        DTABOMPTarget::offload_e_virtual       3.7682     3.7682          10215       0.000368889
      DTABOMPTarget::evaluate_ion_virtual      0.3305     0.0051          10215       0.000032354
        DTABOMPTarget::offload_ion_virtual     0.3254     0.3254          10215       0.000031859
    TwoBodyJastrowRef                          0.4377     0.4377          10215       0.000042850

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.12888e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.16798e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 4.25839e+06


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56981)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56986)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57031)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57036)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 2
Number of walkers per rank = 2

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0630     0.0630              1       0.062997826
  ParticleSet:::update                         0.0000     0.0000              1       0.000003593
Total                                         42.4384     0.0008              1      42.438362982
  Diffusion                                   22.1619     0.0300              5       4.432383870
    Complete Updates                           0.1720     0.0000              5       0.034395042
      DeterminantRef::update                   0.1720     0.1720             10       0.017195653
    Current Gradient                           1.1230     0.0297          30720       0.000036555
      DeterminantRef::ratio                    1.0792     1.0792          30720       0.000035129
      OneBodyJastrowRef                        0.0079     0.0079          30720       0.000000259
      TwoBodyJastrowRef                        0.0061     0.0061          30720       0.000000200
    Kinetic Energy                             0.2915     0.2912              5       0.058295246
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000031439
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017312
    New Gradient                               6.2191     0.0418          30720       0.000202444
      DeterminantRef::ratio                    0.1608     0.1608          30720       0.000005235
      DeterminantRef::spovgl                   5.3515     0.2578          30720       0.000174202
        Single-Particle Orbitals               5.0937     5.0937          30720       0.000165811
      OneBodyJastrowRef                        0.1031     0.1031          30720       0.000003355
      TwoBodyJastrowRef                        0.5620     0.5620          30720       0.000018293
    ParticleSet:::acceptMove                   1.7077     0.0170          15371       0.000111096
      DTAAOMPTarget::update_e_e                1.6718     1.6718          15371       0.000108760
      DTABOMPTarget::update_ion_e              0.0189     0.0189          15371       0.000001231
    ParticleSet:::computeNewPosDT              0.6753     0.0200          30720       0.000021981
      DTAAOMPTarget::move_e_e                  0.5847     0.5847          30720       0.000019035
      DTABOMPTarget::move_ion_e                0.0705     0.0705          30720       0.000002295
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001800
    Update                                    11.9435     0.0168          15371       0.000777015
      DeterminantRef::update                  11.3758    11.3758          15371       0.000740085
      OneBodyJastrowRef                        0.0035     0.0035          15371       0.000000227
      TwoBodyJastrowRef                        0.5474     0.5474          15371       0.000035613
  Initialization                               2.2090     0.1656              1       2.208973781
    DeterminantRef::inverse                    0.7729     0.7729              2       0.386457514
    DeterminantRef::spovgl                     1.0873     0.0759              2       0.543631869
      Single-Particle Orbitals                 1.0113     1.0113           6144       0.000164607
    OneBodyJastrowRef                          0.0083     0.0083              1       0.008324765
    ParticleSet:::update                       0.0734     0.0074              2       0.036699550
      DTAAOMPTarget::evaluate_e_e              0.0503     0.0503              1       0.050321382
      DTABOMPTarget::evaluate_ion_e            0.0156     0.0000              1       0.015643469
        DTABOMPTarget::offload_ion_e           0.0156     0.0156              1       0.015594035
    TwoBodyJastrowRef                          0.1015     0.1015              1       0.101457980
  Pseudopotential                             18.0667     0.0429              5       3.613339434
    DeterminantRef::spoval                    13.4404     0.2459          10215       0.001315755
      Single-Particle Orbitals                13.1945    13.1945         122580       0.000107640
    OneBodyJastrowRef                          0.0183     0.0183          10215       0.000001787
    ParticleSet:::update                       4.1292     0.0094          10215       0.000404226
      DTABOMPTarget::evaluate_e_virtual        3.7902     0.0049          10215       0.000371046
        DTABOMPTarget::offload_e_virtual       3.7853     3.7853          10215       0.000370566
      DTABOMPTarget::evaluate_ion_virtual      0.3295     0.0047          10215       0.000032257
        DTABOMPTarget::offload_ion_virtual     0.3248     0.3248          10215       0.000031801
    TwoBodyJastrowRef                          0.4359     0.4359          10215       0.000042676

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.18602e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 4.18607e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.35764e+06


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57036)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57031)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57108)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57113)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 4
Number of walkers per rank = 4

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0357     0.0357              1       0.035687058
  ParticleSet:::update                         0.0000     0.0000              1       0.000003257
Total                                         40.9043     0.0003              1      40.904336385
  Diffusion                                   21.4597     0.0293              5       4.291946918
    Complete Updates                           0.1718     0.0000              5       0.034354547
      DeterminantRef::update                   0.1718     0.1718             10       0.017175555
    Current Gradient                           1.1166     0.0272          30720       0.000036347
      DeterminantRef::ratio                    1.0756     1.0756          30720       0.000035014
      OneBodyJastrowRef                        0.0076     0.0076          30720       0.000000247
      TwoBodyJastrowRef                        0.0062     0.0062          30720       0.000000201
    Kinetic Energy                             0.2917     0.2915              5       0.058346392
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000030081
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017607
    New Gradient                               5.4380     0.0412          30720       0.000177017
      DeterminantRef::ratio                    0.1561     0.1561          30720       0.000005082
      DeterminantRef::spovgl                   4.5895     0.2485          30720       0.000149399
        Single-Particle Orbitals               4.3411     4.3411          30720       0.000141311
      OneBodyJastrowRef                        0.0979     0.0979          30720       0.000003188
      TwoBodyJastrowRef                        0.5532     0.5532          30720       0.000018007
    ParticleSet:::acceptMove                   1.7217     0.0155          15371       0.000112007
      DTAAOMPTarget::update_e_e                1.6882     1.6882          15371       0.000109830
      DTABOMPTarget::update_ion_e              0.0180     0.0180          15371       0.000001170
    ParticleSet:::computeNewPosDT              0.6663     0.0192          30720       0.000021690
      DTAAOMPTarget::move_e_e                  0.5797     0.5797          30720       0.000018871
      DTABOMPTarget::move_ion_e                0.0674     0.0674          30720       0.000002194
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001938
    Update                                    12.0244     0.0171          15371       0.000782281
      DeterminantRef::update                  11.4501    11.4501          15371       0.000744915
      OneBodyJastrowRef                        0.0035     0.0035          15371       0.000000231
      TwoBodyJastrowRef                        0.5537     0.5537          15371       0.000036021
  Initialization                               2.1632     0.2804              1       2.163225937
    DeterminantRef::inverse                    0.7722     0.7722              2       0.386113349
    DeterminantRef::spovgl                     0.9353     0.0783              2       0.467653551
      Single-Particle Orbitals                 0.8570     0.8570           6144       0.000139483
    OneBodyJastrowRef                          0.0084     0.0084              1       0.008379650
    ParticleSet:::update                       0.0639     0.0078              2       0.031934930
      DTAAOMPTarget::evaluate_e_e              0.0404     0.0404              1       0.040396612
      DTABOMPTarget::evaluate_ion_e            0.0156     0.0001              1       0.015631537
        DTABOMPTarget::offload_ion_e           0.0156     0.0156              1       0.015580766
    TwoBodyJastrowRef                          0.1030     0.1030              1       0.103048386
  Pseudopotential                             17.2811     0.0418              5       3.456217333
    DeterminantRef::spoval                    12.6512     0.2525          10215       0.001238497
      Single-Particle Orbitals                12.3988    12.3988         122580       0.000101148
    OneBodyJastrowRef                          0.0180     0.0180          10215       0.000001759
    ParticleSet:::update                       4.1246     0.0093          10215       0.000403778
      DTABOMPTarget::evaluate_e_virtual        3.7841     0.0048          10215       0.000370444
        DTABOMPTarget::offload_e_virtual       3.7793     3.7793          10215       0.000369978
      DTABOMPTarget::evaluate_ion_virtual      0.3312     0.0044          10215       0.000032422
        DTABOMPTarget::offload_ion_virtual     0.3268     0.3268          10215       0.000031991
    TwoBodyJastrowRef                          0.4455     0.4455          10215       0.000043608

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 4.53601e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 8.64608e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.74752e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57113)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57108)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57172)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57177)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 8
Number of walkers per rank = 8

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0350     0.0350              1       0.035020667
  ParticleSet:::update                         0.0000     0.0000              1       0.000003426
Total                                         41.4529     0.1836              1      41.452877377
  Diffusion                                   21.6889     0.0270              5       4.337770257
    Complete Updates                           0.1748     0.0000              5       0.034966078
      DeterminantRef::update                   0.1748     0.1748             10       0.017481214
    Current Gradient                           1.1010     0.0259          30720       0.000035841
      DeterminantRef::ratio                    1.0632     1.0632          30720       0.000034609
      OneBodyJastrowRef                        0.0065     0.0065          30720       0.000000210
      TwoBodyJastrowRef                        0.0055     0.0055          30720       0.000000179
    Kinetic Energy                             0.2922     0.2920              5       0.058440011
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000031158
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000018451
    New Gradient                               5.3688     0.0370          30720       0.000174766
      DeterminantRef::ratio                    0.1517     0.1517          30720       0.000004937
      DeterminantRef::spovgl                   4.5590     0.2541          30720       0.000148406
        Single-Particle Orbitals               4.3049     4.3049          30720       0.000140134
      OneBodyJastrowRef                        0.0914     0.0914          30720       0.000002974
      TwoBodyJastrowRef                        0.5298     0.5298          30720       0.000017247
    ParticleSet:::acceptMove                   1.6896     0.0153          15371       0.000109922
      DTAAOMPTarget::update_e_e                1.6565     1.6565          15371       0.000107766
      DTABOMPTarget::update_ion_e              0.0178     0.0178          15371       0.000001160
    ParticleSet:::computeNewPosDT              0.6637     0.0181          30720       0.000021606
      DTAAOMPTarget::move_e_e                  0.5777     0.5777          30720       0.000018805
      DTABOMPTarget::move_ion_e                0.0679     0.0679          30720       0.000002210
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002050
    Update                                    12.3716     0.0167          15371       0.000804868
      DeterminantRef::update                  11.8034    11.8034          15371       0.000767900
      OneBodyJastrowRef                        0.0027     0.0027          15371       0.000000177
      TwoBodyJastrowRef                        0.5488     0.5488          15371       0.000035703
  Initialization                               2.2138     0.3079              1       2.213822780
    DeterminantRef::inverse                    0.8015     0.8015              2       0.400739877
    DeterminantRef::spovgl                     0.9200     0.0731              2       0.459982100
      Single-Particle Orbitals                 0.8469     0.8469           6144       0.000137834
    OneBodyJastrowRef                          0.0084     0.0084              1       0.008358757
    ParticleSet:::update                       0.0735     0.0082              2       0.036747305
      DTAAOMPTarget::evaluate_e_e              0.0496     0.0496              1       0.049635030
      DTABOMPTarget::evaluate_ion_e            0.0157     0.0001              1       0.015690543
        DTABOMPTarget::offload_ion_e           0.0156     0.0156              1       0.015573308
    TwoBodyJastrowRef                          0.1026     0.1026              1       0.102580777
  Pseudopotential                             17.3666     0.0428              5       3.473317265
    DeterminantRef::spoval                    12.6500     0.2384          10215       0.001238375
      Single-Particle Orbitals                12.4116    12.4116         122580       0.000101253
    OneBodyJastrowRef                          0.0194     0.0194          10215       0.000001895
    ParticleSet:::update                       4.1242     0.0093          10215       0.000403739
      DTABOMPTarget::evaluate_e_virtual        3.7846     0.0050          10215       0.000370493
        DTABOMPTarget::offload_e_virtual       3.7795     3.7795          10215       0.000369999
      DTABOMPTarget::evaluate_ion_virtual      0.3303     0.0042          10215       0.000032334
        DTABOMPTarget::offload_ion_virtual     0.3261     0.3261          10215       0.000031924
    TwoBodyJastrowRef                          0.5302     0.5302          10215       0.000051905

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 8.95198e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.71095e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 3.47783e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57177)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57172)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57269)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57274)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 16
Number of walkers per rank = 16

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0470     0.0470              1       0.046969219
  ParticleSet:::update                         0.0000     0.0000              1       0.000003158
Total                                         43.4415     0.2984              1      43.441528864
  Diffusion                                   23.0528     0.0283              5       4.610556262
    Complete Updates                           0.1858     0.0000              5       0.037150277
      DeterminantRef::update                   0.1857     0.1857             10       0.018573472
    Current Gradient                           1.1579     0.0258          30720       0.000037692
      DeterminantRef::ratio                    1.1193     1.1193          30720       0.000036435
      OneBodyJastrowRef                        0.0069     0.0069          30720       0.000000225
      TwoBodyJastrowRef                        0.0059     0.0059          30720       0.000000191
    Kinetic Energy                             0.2988     0.2986              5       0.059765917
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000032310
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017885
    New Gradient                               5.7080     0.0394          30720       0.000185809
      DeterminantRef::ratio                    0.1576     0.1576          30720       0.000005129
      DeterminantRef::spovgl                   4.8661     0.2678          30720       0.000158402
        Single-Particle Orbitals               4.5983     4.5983          30720       0.000149684
      OneBodyJastrowRef                        0.0898     0.0898          30720       0.000002925
      TwoBodyJastrowRef                        0.5551     0.5551          30720       0.000018070
    ParticleSet:::acceptMove                   1.8946     0.0165          15371       0.000123255
      DTAAOMPTarget::update_e_e                1.8595     1.8595          15371       0.000120977
      DTABOMPTarget::update_ion_e              0.0186     0.0186          15371       0.000001208
    ParticleSet:::computeNewPosDT              0.6985     0.0189          30720       0.000022738
      DTAAOMPTarget::move_e_e                  0.6079     0.6079          30720       0.000019788
      DTABOMPTarget::move_ion_e                0.0717     0.0717          30720       0.000002335
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001833
    Update                                    13.0808     0.0171          15371       0.000851006
      DeterminantRef::update                  12.4740    12.4740          15371       0.000811530
      OneBodyJastrowRef                        0.0031     0.0031          15371       0.000000204
      TwoBodyJastrowRef                        0.5865     0.5865          15371       0.000038158
  Initialization                               2.3551     0.4078              1       2.355138083
    DeterminantRef::inverse                    0.8358     0.8358              2       0.417898280
    DeterminantRef::spovgl                     0.9266     0.0638              2       0.463305512
      Single-Particle Orbitals                 0.8629     0.8629           6144       0.000140439
    OneBodyJastrowRef                          0.0084     0.0084              1       0.008419974
    ParticleSet:::update                       0.0749     0.0091              2       0.037457610
      DTAAOMPTarget::evaluate_e_e              0.0500     0.0500              1       0.049961866
      DTABOMPTarget::evaluate_ion_e            0.0158     0.0002              1       0.015823971
        DTABOMPTarget::offload_ion_e           0.0156     0.0156              1       0.015580366
    TwoBodyJastrowRef                          0.1016     0.1016              1       0.101569782
  Pseudopotential                             17.7352     0.0473              5       3.547047781
    DeterminantRef::spoval                    12.8147     0.2700          10215       0.001254501
      Single-Particle Orbitals                12.5448    12.5448         122580       0.000102339
    OneBodyJastrowRef                          0.0240     0.0240          10215       0.000002350
    ParticleSet:::update                       4.2144     0.0114          10215       0.000412573
      DTABOMPTarget::evaluate_e_virtual        3.8654     0.0056          10215       0.000378408
        DTABOMPTarget::offload_e_virtual       3.8598     3.8598          10215       0.000377856
      DTABOMPTarget::evaluate_ion_virtual      0.3376     0.0043          10215       0.000033052
        DTABOMPTarget::offload_ion_virtual     0.3334     0.3334          10215       0.000032635
    TwoBodyJastrowRef                          0.6348     0.6348          10215       0.000062143

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.70844e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.21944e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 6.81107e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57269)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57274)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57380)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57385)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 32
Number of walkers per rank = 32

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0495     0.0495              1       0.049491403
  ParticleSet:::update                         0.0000     0.0000              1       0.000003765
Total                                         53.7105     0.3550              1      53.710494028
  Diffusion                                   28.8915     0.0348              5       5.778298715
    Complete Updates                           0.2211     0.0000              5       0.044217617
      DeterminantRef::update                   0.2211     0.2211             10       0.022106851
    Current Gradient                           1.4740     0.0302          30720       0.000047980
      DeterminantRef::ratio                    1.4292     1.4292          30720       0.000046523
      OneBodyJastrowRef                        0.0081     0.0081          30720       0.000000262
      TwoBodyJastrowRef                        0.0065     0.0065          30720       0.000000211
    Kinetic Energy                             0.3387     0.3385              5       0.067748930
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000036828
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000020831
    New Gradient                               6.9904     0.0464          30720       0.000227552
      DeterminantRef::ratio                    0.2164     0.2164          30720       0.000007043
      DeterminantRef::spovgl                   5.8891     0.3616          30720       0.000191703
        Single-Particle Orbitals               5.5275     5.5275          30720       0.000179932
      OneBodyJastrowRef                        0.1052     0.1052          30720       0.000003423
      TwoBodyJastrowRef                        0.7334     0.7334          30720       0.000023874
    ParticleSet:::acceptMove                   2.5594     0.0196          15371       0.000166511
      DTAAOMPTarget::update_e_e                2.5155     2.5155          15371       0.000163654
      DTABOMPTarget::update_ion_e              0.0244     0.0244          15371       0.000001586
    ParticleSet:::computeNewPosDT              0.9132     0.0220          30720       0.000029726
      DTAAOMPTarget::move_e_e                  0.8043     0.8043          30720       0.000026182
      DTABOMPTarget::move_ion_e                0.0868     0.0868          30720       0.000002827
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002040
    Update                                    16.3599     0.0200          15371       0.001064333
      DeterminantRef::update                  15.5962    15.5962          15371       0.001014650
      OneBodyJastrowRef                        0.0032     0.0032          15371       0.000000206
      TwoBodyJastrowRef                        0.7405     0.7405          15371       0.000048174
  Initialization                               2.8612     0.4130              1       2.861200708
    DeterminantRef::inverse                    1.0264     1.0264              2       0.513193276
    DeterminantRef::spovgl                     1.2158     0.1264              2       0.607883092
      Single-Particle Orbitals                 1.0894     1.0894           6144       0.000177310
    OneBodyJastrowRef                          0.0094     0.0094              1       0.009396581
    ParticleSet:::update                       0.0819     0.0134              2       0.040939113
      DTAAOMPTarget::evaluate_e_e              0.0498     0.0498              1       0.049842724
      DTABOMPTarget::evaluate_ion_e            0.0187     0.0002              1       0.018675217
        DTABOMPTarget::offload_ion_e           0.0185     0.0185              1       0.018519542
    TwoBodyJastrowRef                          0.1148     0.1148              1       0.114750116
  Pseudopotential                             21.6028     0.0634              5       4.320554829
    DeterminantRef::spoval                    15.3045     0.3520          10215       0.001498239
      Single-Particle Orbitals                14.9525    14.9525         122580       0.000121982
    OneBodyJastrowRef                          0.0361     0.0361          10215       0.000003533
    ParticleSet:::update                       5.3399     0.0145          10215       0.000522750
      DTABOMPTarget::evaluate_e_virtual        4.9022     0.0071          10215       0.000479900
        DTABOMPTarget::offload_e_virtual       4.8951     4.8951          10215       0.000479207
      DTABOMPTarget::evaluate_ion_virtual      0.4233     0.0053          10215       0.000041436
        DTABOMPTarget::offload_ion_virtual     0.4180     0.4180          10215       0.000040921
    TwoBodyJastrowRef                          0.8589     0.8589          10215       0.000084082

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.7636e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.13764e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.11834e+08


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57380)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57385)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57584)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57589)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 48
Number of walkers per rank = 48

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0426     0.0426              1       0.042590638
  ParticleSet:::update                         0.0000     0.0000              1       0.000003432
Total                                         72.2472     0.0093              1      72.247245661
  Diffusion                                   39.8850     0.0473              5       7.977004592
    Complete Updates                           0.2947     0.0000              5       0.058943054
      DeterminantRef::update                   0.2947     0.2947             10       0.029469036
    Current Gradient                           2.1686     0.0474          30720       0.000070594
      DeterminantRef::ratio                    2.1036     2.1036          30720       0.000068478
      OneBodyJastrowRef                        0.0108     0.0108          30720       0.000000351
      TwoBodyJastrowRef                        0.0068     0.0068          30720       0.000000221
    Kinetic Energy                             0.4264     0.4259              5       0.085279103
      OneBodyJastrowRef                        0.0003     0.0003              5       0.000055613
      TwoBodyJastrowRef                        0.0002     0.0002              5       0.000037898
    New Gradient                               9.5807     0.0580          30720       0.000311873
      DeterminantRef::ratio                    0.3102     0.3102          30720       0.000010097
      DeterminantRef::spovgl                   8.0289     0.5336          30720       0.000261356
        Single-Particle Orbitals               7.4953     7.4953          30720       0.000243988
      OneBodyJastrowRef                        0.1403     0.1403          30720       0.000004566
      TwoBodyJastrowRef                        1.0435     1.0435          30720       0.000033968
    ParticleSet:::acceptMove                   3.5986     0.0286          15371       0.000234115
      DTAAOMPTarget::update_e_e                3.5130     3.5130          15371       0.000228546
      DTABOMPTarget::update_ion_e              0.0570     0.0570          15371       0.000003711
    ParticleSet:::computeNewPosDT              1.3616     0.0286          30720       0.000044324
      DTAAOMPTarget::move_e_e                  1.2082     1.2082          30720       0.000039328
      DTABOMPTarget::move_ion_e                0.1249     0.1249          30720       0.000004066
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002356
    Update                                    22.4070     0.0280          15371       0.001457742
      DeterminantRef::update                  21.3719    21.3719          15371       0.001390404
      OneBodyJastrowRef                        0.0044     0.0044          15371       0.000000283
      TwoBodyJastrowRef                        1.0027     1.0027          15371       0.000065233
  Initialization                               4.0700     0.7762              1       4.070008238
    DeterminantRef::inverse                    1.3612     1.3612              2       0.680621773
    DeterminantRef::spovgl                     1.6645     0.1316              2       0.832269128
      Single-Particle Orbitals                 1.5329     1.5329           6144       0.000249499
    OneBodyJastrowRef                          0.0100     0.0100              1       0.009994917
    ParticleSet:::update                       0.1251     0.0486              2       0.062535036
      DTAAOMPTarget::evaluate_e_e              0.0477     0.0477              1       0.047743709
      DTABOMPTarget::evaluate_ion_e            0.0287     0.0006              1       0.028741747
        DTABOMPTarget::offload_ion_e           0.0281     0.0281              1       0.028128506
    TwoBodyJastrowRef                          0.1330     0.1330              1       0.132993742
  Pseudopotential                             28.2829     0.0820              5       5.656585443
    DeterminantRef::spoval                    19.7880     0.5058          10215       0.001937152
      Single-Particle Orbitals                19.2822    19.2822         122580       0.000157303
    OneBodyJastrowRef                          0.0459     0.0459          10215       0.000004489
    ParticleSet:::update                       7.3149     0.0217          10215       0.000716099
      DTABOMPTarget::evaluate_e_virtual        6.6902     0.0090          10215       0.000654938
        DTABOMPTarget::offload_e_virtual       6.6812     6.6812          10215       0.000654054
      DTABOMPTarget::evaluate_ion_virtual      0.6031     0.0076          10215       0.000059039
        DTABOMPTarget::offload_ion_virtual     0.5955     0.5955          10215       0.000058293
    TwoBodyJastrowRef                          1.0521     1.0521          10215       0.000102998

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.08179e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.58232e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.2813e+08


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57589)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57584)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6  #
###################################################################################################################################################################################################

×