options

Executable Output


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112842)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112847)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.3270     0.3270              1       0.326982379
  ParticleSet:::update                         0.0000     0.0000              1       0.000004383
Total                                         42.3765     0.0003              1      42.376504688
  Diffusion                                   22.2184     0.0174              5       4.443688108
    Complete Updates                           0.1707     0.0000              5       0.034147655
      DeterminantRef::update                   0.1707     0.1707             10       0.017072748
    Current Gradient                           0.9853     0.0132          30720       0.000032074
      DeterminantRef::ratio                    0.9646     0.9646          30720       0.000031399
      OneBodyJastrowRef                        0.0044     0.0044          30720       0.000000144
      TwoBodyJastrowRef                        0.0031     0.0031          30720       0.000000101
    Kinetic Energy                             0.2368     0.2365              5       0.047352978
      OneBodyJastrowRef                        0.0001     0.0001              5       0.000027305
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000016840
    New Gradient                               5.7037     0.0195          30720       0.000185666
      DeterminantRef::ratio                    0.1425     0.1425          30720       0.000004638
      DeterminantRef::spovgl                   4.9398     0.2763          30720       0.000160802
        Single-Particle Orbitals               4.6636     4.6636          30720       0.000151809
      OneBodyJastrowRef                        0.0546     0.0546          30720       0.000001779
      TwoBodyJastrowRef                        0.5472     0.5472          30720       0.000017813
    ParticleSet:::acceptMove                   2.4571     0.0209          15371       0.000159856
      DTAAOMPTarget::update_e_e                2.4017     2.4017          15371       0.000156249
      DTABOMPTarget::update_ion_e              0.0345     0.0345          15371       0.000002246
    ParticleSet:::computeNewPosDT              0.7233     0.0136          30720       0.000023546
      DTAAOMPTarget::move_e_e                  0.6170     0.6170          30720       0.000020085
      DTABOMPTarget::move_ion_e                0.0927     0.0927          30720       0.000003017
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001525
    Update                                    11.9241     0.0110          15371       0.000775751
      DeterminantRef::update                  11.3936    11.3936          15371       0.000741241
      OneBodyJastrowRef                        0.0016     0.0016          15371       0.000000107
      TwoBodyJastrowRef                        0.5179     0.5179          15371       0.000033690
  Initialization                               2.5927     0.3555              1       2.592695627
    DeterminantRef::inverse                    0.9849     0.9849              2       0.492473422
    DeterminantRef::spovgl                     1.0287     0.1093              2       0.514360128
      Single-Particle Orbitals                 0.9194     0.9194           6144       0.000149638
    OneBodyJastrowRef                          0.0072     0.0072              1       0.007192175
    ParticleSet:::update                       0.1205     0.0205              2       0.060259089
      DTAAOMPTarget::evaluate_e_e              0.0812     0.0812              1       0.081233726
      DTABOMPTarget::evaluate_ion_e            0.0187     0.0001              1       0.018741625
        DTABOMPTarget::offload_ion_e           0.0187     0.0187              1       0.018674736
    TwoBodyJastrowRef                          0.0959     0.0959              1       0.095857051
  Pseudopotential                             17.5651     0.0287              5       3.513020973
    DeterminantRef::spoval                    12.6068     0.2454          10215       0.001234149
      Single-Particle Orbitals                12.3614    12.3614         122580       0.000100844
    OneBodyJastrowRef                          0.0133     0.0133          10215       0.000001298
    ParticleSet:::update                       4.4489     0.0058          10215       0.000435522
      DTABOMPTarget::evaluate_e_virtual        4.0952     0.0021          10215       0.000400896
        DTABOMPTarget::offload_e_virtual       4.0931     4.0931          10215       0.000400693
      DTABOMPTarget::evaluate_ion_virtual      0.3479     0.0023          10215       0.000034059
        DTABOMPTarget::offload_ion_virtual     0.3456     0.3456          10215       0.000033830
    TwoBodyJastrowRef                          0.4674     0.4674          10215       0.000045761

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.09461e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.08771e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 4.29815e+06


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112847)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112842)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112889)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112894)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 2
Number of walkers per rank = 2

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1934     0.1934              1       0.193427395
  ParticleSet:::update                         0.0000     0.0000              1       0.000004714
Total                                         44.9653     0.0008              1      44.965347594
  Diffusion                                   23.9697     0.0201              5       4.793940323
    Complete Updates                           0.1783     0.0000              5       0.035661945
      DeterminantRef::update                   0.1783     0.1783             10       0.017829476
    Current Gradient                           1.0094     0.0155          30720       0.000032857
      DeterminantRef::ratio                    0.9865     0.9865          30720       0.000032113
      OneBodyJastrowRef                        0.0047     0.0047          30720       0.000000153
      TwoBodyJastrowRef                        0.0027     0.0027          30720       0.000000088
    Kinetic Energy                             0.2456     0.2453              5       0.049118265
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000045645
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000015824
    New Gradient                               6.9855     0.0199          30720       0.000227392
      DeterminantRef::ratio                    0.1431     0.1431          30720       0.000004660
      DeterminantRef::spovgl                   6.2215     0.2951          30720       0.000202524
        Single-Particle Orbitals               5.9264     5.9264          30720       0.000192918
      OneBodyJastrowRef                        0.0549     0.0549          30720       0.000001787
      TwoBodyJastrowRef                        0.5460     0.5460          30720       0.000017774
    ParticleSet:::acceptMove                   2.4587     0.0211          15371       0.000159957
      DTAAOMPTarget::update_e_e                2.4035     2.4035          15371       0.000156363
      DTABOMPTarget::update_ion_e              0.0341     0.0341          15371       0.000002218
    ParticleSet:::computeNewPosDT              0.9472     0.0147          30720       0.000030833
      DTAAOMPTarget::move_e_e                  0.8395     0.8395          30720       0.000027327
      DTABOMPTarget::move_ion_e                0.0930     0.0930          30720       0.000003028
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001219
    Update                                    12.1249     0.0119          15371       0.000788818
      DeterminantRef::update                  11.5872    11.5872          15371       0.000753833
      OneBodyJastrowRef                        0.0015     0.0015          15371       0.000000100
      TwoBodyJastrowRef                        0.5244     0.5244          15371       0.000034113
  Initialization                               2.9373     0.3632              1       2.937284381
    DeterminantRef::inverse                    1.0550     1.0550              2       0.527521101
    DeterminantRef::spovgl                     1.2947     0.1367              2       0.647364808
      Single-Particle Orbitals                 1.1580     1.1580           6144       0.000188475
    OneBodyJastrowRef                          0.0077     0.0077              1       0.007711939
    ParticleSet:::update                       0.1210     0.0205              2       0.060503032
      DTAAOMPTarget::evaluate_e_e              0.0821     0.0821              1       0.082080402
      DTABOMPTarget::evaluate_ion_e            0.0184     0.0001              1       0.018412011
        DTABOMPTarget::offload_ion_e           0.0183     0.0183              1       0.018346678
    TwoBodyJastrowRef                          0.0956     0.0956              1       0.095571281
  Pseudopotential                             18.0576     0.0317              5       3.611520465
    DeterminantRef::spoval                    13.0365     0.2414          10215       0.001276211
      Single-Particle Orbitals                12.7951    12.7951         122580       0.000104381
    OneBodyJastrowRef                          0.0134     0.0134          10215       0.000001311
    ParticleSet:::update                       4.5020     0.0062          10215       0.000440720
      DTABOMPTarget::evaluate_e_virtual        4.1452     0.0022          10215       0.000405800
        DTABOMPTarget::offload_e_virtual       4.1431     4.1431          10215       0.000405588
      DTABOMPTarget::evaluate_ion_virtual      0.3505     0.0024          10215       0.000034316
        DTABOMPTarget::offload_ion_virtual     0.3482     0.3482          10215       0.000034082
    TwoBodyJastrowRef                          0.4741     0.4741          10215       0.000046413

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.06317e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.87036e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.36185e+06


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112894)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112889)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112968)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112973)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 4
Number of walkers per rank = 4

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1057     0.1057              1       0.105698166
  ParticleSet:::update                         0.0000     0.0000              1       0.000005669
Total                                         43.3824     0.5277              1      43.382426670
  Diffusion                                   22.7186     0.0218              5       4.543717029
    Complete Updates                           0.1774     0.0000              5       0.035475395
      DeterminantRef::update                   0.1774     0.1774             10       0.017736625
    Current Gradient                           1.0281     0.0150          30720       0.000033467
      DeterminantRef::ratio                    1.0059     1.0059          30720       0.000032743
      OneBodyJastrowRef                        0.0041     0.0041          30720       0.000000133
      TwoBodyJastrowRef                        0.0031     0.0031          30720       0.000000102
    Kinetic Energy                             0.2496     0.2494              5       0.049923323
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000034812
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017503
    New Gradient                               5.7590     0.0211          30720       0.000187466
      DeterminantRef::ratio                    0.1453     0.1453          30720       0.000004728
      DeterminantRef::spovgl                   4.9864     0.3152          30720       0.000162318
        Single-Particle Orbitals               4.6712     4.6712          30720       0.000152057
      OneBodyJastrowRef                        0.0557     0.0557          30720       0.000001812
      TwoBodyJastrowRef                        0.5505     0.5505          30720       0.000017920
    ParticleSet:::acceptMove                   2.3681     0.0179          15371       0.000154061
      DTAAOMPTarget::update_e_e                2.3160     2.3160          15371       0.000150676
      DTABOMPTarget::update_ion_e              0.0342     0.0342          15371       0.000002224
    ParticleSet:::computeNewPosDT              1.0390     0.0164          30720       0.000033823
      DTAAOMPTarget::move_e_e                  0.9283     0.9283          30720       0.000030217
      DTABOMPTarget::move_ion_e                0.0944     0.0944          30720       0.000003074
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001362
    Update                                    12.0756     0.0120          15371       0.000785610
      DeterminantRef::update                  11.5242    11.5242          15371       0.000749737
      OneBodyJastrowRef                        0.0015     0.0015          15371       0.000000098
      TwoBodyJastrowRef                        0.5379     0.5379          15371       0.000034993
  Initialization                               2.7977     0.6954              1       2.797670594
    DeterminantRef::inverse                    0.9207     0.9207              2       0.460328774
    DeterminantRef::spovgl                     0.9601     0.0950              2       0.480033883
      Single-Particle Orbitals                 0.8650     0.8650           6144       0.000140795
    OneBodyJastrowRef                          0.0077     0.0077              1       0.007654826
    ParticleSet:::update                       0.1187     0.0207              2       0.059350259
      DTAAOMPTarget::evaluate_e_e              0.0795     0.0795              1       0.079456313
      DTABOMPTarget::evaluate_ion_e            0.0186     0.0001              1       0.018584611
        DTABOMPTarget::offload_ion_e           0.0185     0.0185              1       0.018526706
    TwoBodyJastrowRef                          0.0952     0.0952              1       0.095239758
  Pseudopotential                             17.3385     0.0305              5       3.467695513
    DeterminantRef::spoval                    12.3557     0.2324          10215       0.001209565
      Single-Particle Orbitals                12.1233    12.1233         122580       0.000098901
    OneBodyJastrowRef                          0.0130     0.0130          10215       0.000001276
    ParticleSet:::update                       4.4694     0.0058          10215       0.000437532
      DTABOMPTarget::evaluate_e_virtual        4.1122     0.0021          10215       0.000402561
        DTABOMPTarget::offload_e_virtual       4.1101     4.1101          10215       0.000402359
      DTABOMPTarget::evaluate_ion_virtual      0.3514     0.0025          10215       0.000034405
        DTABOMPTarget::offload_ion_virtual     0.3489     0.3489          10215       0.000034156
    TwoBodyJastrowRef                          0.4699     0.4699          10215       0.000045998

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 4.27691e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 8.167e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.74173e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112968)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112973)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113040)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113045)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 8
Number of walkers per rank = 8

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0714     0.0714              1       0.071419500
  ParticleSet:::update                         0.0000     0.0000              1       0.000004271
Total                                         44.9975     0.4686              1      44.997495777
  Diffusion                                   23.7474     0.0209              5       4.749477106
    Complete Updates                           0.1966     0.0000              5       0.039318672
      DeterminantRef::update                   0.1966     0.1966             10       0.019657799
    Current Gradient                           1.0368     0.0143          30720       0.000033750
      DeterminantRef::ratio                    1.0153     1.0153          30720       0.000033050
      OneBodyJastrowRef                        0.0042     0.0042          30720       0.000000138
      TwoBodyJastrowRef                        0.0030     0.0030          30720       0.000000097
    Kinetic Energy                             0.2654     0.2651              5       0.053080173
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000033901
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000019325
    New Gradient                               5.8018     0.0212          30720       0.000188861
      DeterminantRef::ratio                    0.1420     0.1420          30720       0.000004622
      DeterminantRef::spovgl                   5.0528     0.2804          30720       0.000164480
        Single-Particle Orbitals               4.7724     4.7724          30720       0.000155352
      OneBodyJastrowRef                        0.0547     0.0547          30720       0.000001779
      TwoBodyJastrowRef                        0.5312     0.5312          30720       0.000017291
    ParticleSet:::acceptMove                   2.4966     0.0213          15371       0.000162420
      DTAAOMPTarget::update_e_e                2.4422     2.4422          15371       0.000158882
      DTABOMPTarget::update_ion_e              0.0331     0.0331          15371       0.000002155
    ParticleSet:::computeNewPosDT              0.8688     0.0151          30720       0.000028282
      DTAAOMPTarget::move_e_e                  0.7640     0.7640          30720       0.000024871
      DTABOMPTarget::move_ion_e                0.0896     0.0896          30720       0.000002918
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000000789
    Update                                    13.0605     0.0133          15371       0.000849687
      DeterminantRef::update                  12.4937    12.4937          15371       0.000812808
      OneBodyJastrowRef                        0.0017     0.0017          15371       0.000000107
      TwoBodyJastrowRef                        0.5519     0.5519          15371       0.000035909
  Initialization                               2.8181     0.4923              1       2.818070391
    DeterminantRef::inverse                    1.0637     1.0637              2       0.531851775
    DeterminantRef::spovgl                     1.0282     0.1124              2       0.514102669
      Single-Particle Orbitals                 0.9158     0.9158           6144       0.000149054
    OneBodyJastrowRef                          0.0077     0.0077              1       0.007687805
    ParticleSet:::update                       0.1303     0.0219              2       0.065167320
      DTAAOMPTarget::evaluate_e_e              0.0898     0.0898              1       0.089799057
      DTABOMPTarget::evaluate_ion_e            0.0186     0.0001              1       0.018612597
        DTABOMPTarget::offload_ion_e           0.0185     0.0185              1       0.018548890
    TwoBodyJastrowRef                          0.0958     0.0958              1       0.095818948
  Pseudopotential                             17.9635     0.0319              5       3.592695909
    DeterminantRef::spoval                    12.8554     0.2195          10215       0.001258478
      Single-Particle Orbitals                12.6359    12.6359         122580       0.000103083
    OneBodyJastrowRef                          0.0151     0.0151          10215       0.000001478
    ParticleSet:::update                       4.5304     0.0072          10215       0.000443505
      DTABOMPTarget::evaluate_e_virtual        4.1616     0.0025          10215       0.000407404
        DTABOMPTarget::offload_e_virtual       4.1592     4.1592          10215       0.000407161
      DTABOMPTarget::evaluate_ion_virtual      0.3616     0.0029          10215       0.000035400
        DTABOMPTarget::offload_ion_virtual     0.3588     0.3588          10215       0.000035121
    TwoBodyJastrowRef                          0.5308     0.5308          10215       0.000051960

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 8.2468e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.56264e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 3.36226e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113045)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113040)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113144)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113149)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 16
Number of walkers per rank = 16

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0668     0.0668              1       0.066807586
  ParticleSet:::update                         0.0000     0.0000              1       0.000004194
Total                                         51.4021     0.7414              1      51.402070551
  Diffusion                                   27.4747     0.0205              5       5.494941222
    Complete Updates                           0.2323     0.0000              5       0.046455391
      DeterminantRef::update                   0.2323     0.2323             10       0.023226010
    Current Gradient                           1.1426     0.0158          30720       0.000037195
      DeterminantRef::ratio                    1.1193     1.1193          30720       0.000036436
      OneBodyJastrowRef                        0.0044     0.0044          30720       0.000000144
      TwoBodyJastrowRef                        0.0031     0.0031          30720       0.000000102
    Kinetic Energy                             0.3076     0.3073              5       0.061517052
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000032285
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000018519
    New Gradient                               6.6775     0.0193          30720       0.000217366
      DeterminantRef::ratio                    0.1465     0.1465          30720       0.000004767
      DeterminantRef::spovgl                   5.8941     0.2783          30720       0.000191867
        Single-Particle Orbitals               5.6158     5.6158          30720       0.000182808
      OneBodyJastrowRef                        0.0580     0.0580          30720       0.000001887
      TwoBodyJastrowRef                        0.5596     0.5596          30720       0.000018217
    ParticleSet:::acceptMove                   3.1231     0.0255          15371       0.000203180
      DTAAOMPTarget::update_e_e                3.0635     3.0635          15371       0.000199304
      DTABOMPTarget::update_ion_e              0.0341     0.0341          15371       0.000002219
    ParticleSet:::computeNewPosDT              0.7777     0.0148          30720       0.000025316
      DTAAOMPTarget::move_e_e                  0.6684     0.6684          30720       0.000021758
      DTABOMPTarget::move_ion_e                0.0945     0.0945          30720       0.000003075
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001637
    Update                                    15.1934     0.0141          15371       0.000988446
      DeterminantRef::update                  14.5033    14.5033          15371       0.000943550
      OneBodyJastrowRef                        0.0016     0.0016          15371       0.000000101
      TwoBodyJastrowRef                        0.6745     0.6745          15371       0.000043878
  Initialization                               3.4351     0.8485              1       3.435084857
    DeterminantRef::inverse                    1.1304     1.1304              2       0.565204658
    DeterminantRef::spovgl                     1.1878     0.1288              2       0.593908241
      Single-Particle Orbitals                 1.0590     1.0590           6144       0.000172360
    OneBodyJastrowRef                          0.0083     0.0083              1       0.008285255
    ParticleSet:::update                       0.1514     0.0210              2       0.075708665
      DTAAOMPTarget::evaluate_e_e              0.1101     0.1101              1       0.110121997
      DTABOMPTarget::evaluate_ion_e            0.0203     0.0001              1       0.020336798
        DTABOMPTarget::offload_ion_e           0.0203     0.0203              1       0.020266407
    TwoBodyJastrowRef                          0.1087     0.1087              1       0.108688709
  Pseudopotential                             19.7509     0.0407              5       3.950177296
    DeterminantRef::spoval                    14.1817     0.2771          10215       0.001388323
      Single-Particle Orbitals                13.9046    13.9046         122580       0.000113433
    OneBodyJastrowRef                          0.0219     0.0219          10215       0.000002142
    ParticleSet:::update                       4.7974     0.0103          10215       0.000469646
      DTABOMPTarget::evaluate_e_virtual        4.3966     0.0039          10215       0.000430404
        DTABOMPTarget::offload_e_virtual       4.3927     4.3927          10215       0.000430020
      DTABOMPTarget::evaluate_ion_virtual      0.3905     0.0033          10215       0.000038231
        DTABOMPTarget::offload_ion_virtual     0.3872     0.3872          10215       0.000037904
    TwoBodyJastrowRef                          0.7092     0.7092          10215       0.000069424

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.44385e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.70129e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 6.11598e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113144)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113149)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113274)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113279)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 32
Number of walkers per rank = 32

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0624     0.0624              1       0.062409120
  ParticleSet:::update                         0.0000     0.0000              1       0.000004199
Total                                         74.5545     1.1318              1      74.554523638
  Diffusion                                   41.5660     0.0328              5       8.313199207
    Complete Updates                           0.3603     0.0000              5       0.072069559
      DeterminantRef::update                   0.3603     0.3603             10       0.036032264
    Current Gradient                           1.7267     0.0234          30720       0.000056208
      DeterminantRef::ratio                    1.6926     1.6926          30720       0.000055098
      OneBodyJastrowRef                        0.0065     0.0065          30720       0.000000212
      TwoBodyJastrowRef                        0.0042     0.0042          30720       0.000000137
    Kinetic Energy                             0.4461     0.4457              5       0.089215936
      OneBodyJastrowRef                        0.0002     0.0002              5       0.000043055
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000024284
    New Gradient                              10.0496     0.0279          30720       0.000327135
      DeterminantRef::ratio                    0.2205     0.2205          30720       0.000007178
      DeterminantRef::spovgl                   8.8648     0.4327          30720       0.000288567
        Single-Particle Orbitals               8.4321     8.4321          30720       0.000274481
      OneBodyJastrowRef                        0.0880     0.0880          30720       0.000002865
      TwoBodyJastrowRef                        0.8484     0.8484          30720       0.000027616
    ParticleSet:::acceptMove                   5.6717     0.0379          15371       0.000368988
      DTAAOMPTarget::update_e_e                5.5908     5.5908          15371       0.000363721
      DTABOMPTarget::update_ion_e              0.0431     0.0431          15371       0.000002802
    ParticleSet:::computeNewPosDT              1.1776     0.0197          30720       0.000038333
      DTAAOMPTarget::move_e_e                  1.0302     1.0302          30720       0.000033534
      DTABOMPTarget::move_ion_e                0.1277     0.1277          30720       0.000004158
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002399
    Update                                    22.1012     0.0194          15371       0.001437849
      DeterminantRef::update                  20.9085    20.9085          15371       0.001360258
      OneBodyJastrowRef                        0.0021     0.0021          15371       0.000000136
      TwoBodyJastrowRef                        1.1712     1.1712          15371       0.000076195
  Initialization                               4.7395     1.0510              1       4.739470701
    DeterminantRef::inverse                    1.5892     1.5892              2       0.794613049
    DeterminantRef::spovgl                     1.7197     0.1758              2       0.859845106
      Single-Particle Orbitals                 1.5439     1.5439           6144       0.000251289
    OneBodyJastrowRef                          0.0103     0.0103              1       0.010297933
    ParticleSet:::update                       0.2473     0.0904              2       0.123629081
      DTAAOMPTarget::evaluate_e_e              0.1287     0.1287              1       0.128688133
      DTABOMPTarget::evaluate_ion_e            0.0281     0.0031              1       0.028120763
        DTABOMPTarget::offload_ion_e           0.0250     0.0250              1       0.025004214
    TwoBodyJastrowRef                          0.1220     0.1220              1       0.122019323
  Pseudopotential                             27.1172     0.0756              5       5.423441918
    DeterminantRef::spoval                    19.6306     0.4108          10215       0.001921738
      Single-Particle Orbitals                19.2198    19.2198         122580       0.000156794
    OneBodyJastrowRef                          0.0422     0.0422          10215       0.000004127
    ParticleSet:::update                       6.1264     0.0183          10215       0.000599746
      DTABOMPTarget::evaluate_e_virtual        5.6166     0.0069          10215       0.000549837
        DTABOMPTarget::offload_e_virtual       5.6097     5.6097          10215       0.000549164
      DTABOMPTarget::evaluate_ion_virtual      0.4916     0.0076          10215       0.000048122
        DTABOMPTarget::offload_ion_virtual     0.4839     0.4839          10215       0.000047376
    TwoBodyJastrowRef                          1.2425     1.2425          10215       0.000121631

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.99095e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.57105e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.90917e+07


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113279)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113274)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5  #
###################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113478)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113483)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 48
Number of walkers per rank = 48

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0874     0.0874              1       0.087375768
  ParticleSet:::update                         0.0000     0.0000              1       0.000006093
Total                                        106.2653     0.2648              1     106.265251368
  Diffusion                                   63.4955     0.0463              5      12.699109602
    Complete Updates                           0.3674     0.0000              5       0.073473642
      DeterminantRef::update                   0.3673     0.3673             10       0.036733776
    Current Gradient                           2.8390     0.0383          30720       0.000092415
      DeterminantRef::ratio                    2.7828     2.7828          30720       0.000090585
      OneBodyJastrowRef                        0.0114     0.0114          30720       0.000000372
      TwoBodyJastrowRef                        0.0065     0.0065          30720       0.000000213
    Kinetic Energy                             0.5897     0.5891              5       0.117936987
      OneBodyJastrowRef                        0.0004     0.0004              5       0.000075926
      TwoBodyJastrowRef                        0.0002     0.0002              5       0.000041192
    New Gradient                              15.2225     0.0441          30720       0.000495523
      DeterminantRef::ratio                    0.3757     0.3757          30720       0.000012231
      DeterminantRef::spovgl                  13.2822     0.7859          30720       0.000432362
        Single-Particle Orbitals              12.4963    12.4963          30720       0.000406781
      OneBodyJastrowRef                        0.1437     0.1437          30720       0.000004678
      TwoBodyJastrowRef                        1.3768     1.3768          30720       0.000044817
    ParticleSet:::acceptMove                   8.6575     0.0243          15371       0.000563234
      DTAAOMPTarget::update_e_e                8.5621     8.5621          15371       0.000557031
      DTABOMPTarget::update_ion_e              0.0711     0.0711          15371       0.000004624
    ParticleSet:::computeNewPosDT              1.8938     0.0244          30720       0.000061646
      DTAAOMPTarget::move_e_e                  1.6771     1.6771          30720       0.000054594
      DTABOMPTarget::move_ion_e                0.1922     0.1922          30720       0.000006256
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002495
    Update                                    33.8795     0.0157          15371       0.002204118
      DeterminantRef::update                  32.1439    32.1439          15371       0.002091204
      OneBodyJastrowRef                        0.0069     0.0069          15371       0.000000448
      TwoBodyJastrowRef                        1.7130     1.7130          15371       0.000111444
  Initialization                               6.4792     1.5664              1       6.479216213
    DeterminantRef::inverse                    2.2005     2.2005              2       1.100271850
    DeterminantRef::spovgl                     2.2997     0.1676              2       1.149850815
      Single-Particle Orbitals                 2.1321     2.1321           6144       0.000347025
    OneBodyJastrowRef                          0.0164     0.0164              1       0.016353389
    ParticleSet:::update                       0.2252     0.1021              2       0.112589045
      DTAAOMPTarget::evaluate_e_e              0.0889     0.0889              1       0.088856692
      DTABOMPTarget::evaluate_ion_e            0.0343     0.0080              1       0.034266822
        DTABOMPTarget::offload_ion_e           0.0262     0.0262              1       0.026219456
    TwoBodyJastrowRef                          0.1711     0.1711              1       0.171064253
  Pseudopotential                             36.0257     0.1055              5       7.205130145
    DeterminantRef::spoval                    25.9077     0.6149          10215       0.002536239
      Single-Particle Orbitals                25.2928    25.2928         122580       0.000206337
    OneBodyJastrowRef                          0.0622     0.0622          10215       0.000006089
    ParticleSet:::update                       8.2552     0.0268          10215       0.000808142
      DTABOMPTarget::evaluate_e_virtual        7.5218     0.0088          10215       0.000736349
        DTABOMPTarget::offload_e_virtual       7.5130     7.5130          10215       0.000735488
      DTABOMPTarget::evaluate_ion_virtual      0.7066     0.0097          10215       0.000069169
        DTABOMPTarget::offload_ion_virtual     0.6968     0.6968          10215       0.000068216
    TwoBodyJastrowRef                          1.6951     1.6951          10215       0.000165939

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.09524e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.50656e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.00592e+08


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113478)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113483)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6  #
###################################################################################################################################################################################################

×