options

Executable Output

* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56981)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56986)miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.1216     0.1216              1       0.121563821
  ParticleSet:::update                         0.0000     0.0000              1       0.000003488
Total                                         41.0901     0.0002              1      41.090138003
  Diffusion                                   21.3957     0.0310              5       4.279149120
    Complete Updates                           0.1687     0.0000              5       0.033747691
      DeterminantRef::update                   0.1687     0.1687             10       0.016872364
    Current Gradient                           1.1224     0.0300          30720       0.000036535
      DeterminantRef::ratio                    1.0766     1.0766          30720       0.000035046
      OneBodyJastrowRef                        0.0086     0.0086          30720       0.000000279
      TwoBodyJastrowRef                        0.0072     0.0072          30720       0.000000235
    Kinetic Energy                             0.2910     0.2908              5       0.058204901
      OneBodyJastrowRef                        0.0001     0.0001              5       0.000028718
      TwoBodyJastrowRef                        0.0001     0.0001              5       0.000017759
    New Gradient                               5.3153     0.0452          30720       0.000173024
      DeterminantRef::ratio                    0.1623     0.1623          30720       0.000005284
      DeterminantRef::spovgl                   4.4317     0.2651          30720       0.000144260
        Single-Particle Orbitals               4.1666     4.1666          30720       0.000135631
      OneBodyJastrowRef                        0.1085     0.1085          30720       0.000003533
      TwoBodyJastrowRef                        0.5675     0.5675          30720       0.000018475
    ParticleSet:::acceptMove                   1.7243     0.0167          15371       0.000112178
      DTAAOMPTarget::update_e_e                1.6892     1.6892          15371       0.000109897
      DTABOMPTarget::update_ion_e              0.0184     0.0184          15371       0.000001197
    ParticleSet:::computeNewPosDT              0.7248     0.0222          30720       0.000023595
      DTAAOMPTarget::move_e_e                  0.6224     0.6224          30720       0.000020259
      DTABOMPTarget::move_ion_e                0.0803     0.0803          30720       0.000002614
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000002534
    Update                                    12.0182     0.0180          15371       0.000781876
      DeterminantRef::update                  11.4449    11.4449          15371       0.000744580
      OneBodyJastrowRef                        0.0035     0.0035          15371       0.000000229
      TwoBodyJastrowRef                        0.5518     0.5518          15371       0.000035896
  Initialization                               1.9651     0.1664              1       1.965094128
    DeterminantRef::inverse                    0.7916     0.7916              2       0.395802906
    DeterminantRef::spovgl                     0.8355     0.0602              2       0.417732546
      Single-Particle Orbitals                 0.7753     0.7753           6144       0.000126186
    OneBodyJastrowRef                          0.0084     0.0084              1       0.008350388
    ParticleSet:::update                       0.0619     0.0074              2       0.030927490
      DTAAOMPTarget::evaluate_e_e              0.0389     0.0389              1       0.038885437
      DTABOMPTarget::evaluate_ion_e            0.0155     0.0001              1       0.015531206
        DTABOMPTarget::offload_ion_e           0.0155     0.0155              1       0.015480491
    TwoBodyJastrowRef                          0.1014     0.1014              1       0.101421842
  Pseudopotential                             17.7291     0.0447              5       3.545818692
    DeterminantRef::spoval                    13.1126     0.2675          10215       0.001283662
      Single-Particle Orbitals                12.8452    12.8452         122580       0.000104790
    OneBodyJastrowRef                          0.0198     0.0198          10215       0.000001938
    ParticleSet:::update                       4.1142     0.0104          10215       0.000402764
      DTABOMPTarget::evaluate_e_virtual        3.7733     0.0051          10215       0.000369391
        DTABOMPTarget::offload_e_virtual       3.7682     3.7682          10215       0.000368889
      DTABOMPTarget::evaluate_ion_virtual      0.3305     0.0051          10215       0.000032354
        DTABOMPTarget::offload_ion_virtual     0.3254     0.3254          10215       0.000031859
    TwoBodyJastrowRef                          0.4377     0.4377          10215       0.000042850

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.12888e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.16798e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 4.25839e+06


* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56981)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56986)

Info: 1/2 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0

To display your profiling results:
###################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                              #
###################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0  #
###################################################################################################################################################################################################

×