options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node turpancomp0

* Info: "ref-cycles" not supported on turpancomp0: fallback to "cpu-clock"
* Info: Process launched (host turpancomp0, process 673783)miniqmc not built from git repository

number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 1536
Tile size = 1536
Number of tiles = 1
Number of electrons = 3072
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 80
Number of walkers per rank = 80

SPO coefficients size = 786432000 bytes (750 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0184     0.0184              1       0.018449605
  ParticleSet:::update                         0.0000     0.0000              1       0.000001200
Total                                         56.3573     0.4237              1      56.357334906
  Diffusion                                   33.5689     0.0711              5       6.713789141
    Complete Updates                           0.4785     0.0001              5       0.095701593
      DeterminantRef::update                   0.4784     0.4784             10       0.047842597
    Current Gradient                           3.0184     0.0460          15360       0.000196513
      DeterminantRef::ratio                    2.9456     2.9456          15360       0.000191769
      OneBodyJastrowRef                        0.0130     0.0130          15360       0.000000844
      TwoBodyJastrowRef                        0.0139     0.0139          15360       0.000000904
    Kinetic Energy                             0.4449     0.4441              5       0.088977676
      OneBodyJastrowRef                        0.0005     0.0005              5       0.000090761
      TwoBodyJastrowRef                        0.0003     0.0003              5       0.000061681
    New Gradient                               8.9137     0.0629          15360       0.000580318
      DeterminantRef::ratio                    0.0835     0.0835          15360       0.000005437
      DeterminantRef::spovgl                   7.7016     0.5069          15360       0.000501404
        Single-Particle Orbitals               7.1946     7.1946          15360       0.000468400
      OneBodyJastrowRef                        0.1744     0.1744          15360       0.000011353
      TwoBodyJastrowRef                        0.8913     0.8913          15360       0.000058027
    ParticleSet:::acceptMove                   4.1287     0.0339           7611       0.000542469
      DTAAOMPTarget::update_e_e                4.0363     4.0363           7611       0.000530326
      DTABOMPTarget::update_ion_e              0.0585     0.0585           7611       0.000007691
    ParticleSet:::computeNewPosDT              1.3728     0.0333          15360       0.000089375
      DTAAOMPTarget::move_e_e                  1.1733     1.1733          15360       0.000076389
      DTABOMPTarget::move_ion_e                0.1662     0.1662          15360       0.000010819
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000003216
    Update                                    15.1407     0.0289           7611       0.001989322
      DeterminantRef::update                  13.7954    13.7954           7611       0.001812567
      OneBodyJastrowRef                        0.0049     0.0049           7611       0.000000643
      TwoBodyJastrowRef                        1.3115     1.3115           7611       0.000172317
  Initialization                               2.5915     0.7048              1       2.591505074
    DeterminantRef::inverse                    0.5657     0.5657              2       0.282871163
    DeterminantRef::spovgl                     1.0872     0.0849              2       0.543587608
      Single-Particle Orbitals                 1.0023     1.0023           3072       0.000326253
    OneBodyJastrowRef                          0.0065     0.0065              1       0.006531458
    ParticleSet:::update                       0.1861     0.1291              2       0.093030090
      DTAAOMPTarget::evaluate_e_e              0.0287     0.0287              1       0.028689376
      DTABOMPTarget::evaluate_ion_e            0.0283     0.0001              1       0.028297212
        DTABOMPTarget::offload_ion_e           0.0282     0.0282              1       0.028219812
    TwoBodyJastrowRef                          0.0412     0.0412              1       0.041240888
  Pseudopotential                             19.7732     0.1101              5       3.954632642
    DeterminantRef::spoval                    16.1216     0.2795           5359       0.003008315
      Single-Particle Orbitals                15.8421    15.8421          64308       0.000246347
    OneBodyJastrowRef                          0.0833     0.0833           5359       0.000015538
    ParticleSet:::update                       2.4871     0.0201           5359       0.000464094
      DTABOMPTarget::evaluate_e_virtual        2.0799     0.0107           5359       0.000388113
        DTABOMPTarget::offload_e_virtual       2.0692     2.0692           5359       0.000386121
      DTABOMPTarget::evaluate_ion_virtual      0.3871     0.0078           5359       0.000072225
        DTABOMPTarget::offload_ion_virtual     0.3793     0.3793           5359       0.000070779
    TwoBodyJastrowRef                          0.9711     0.9711           5359       0.000181211

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 4.11532e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 6.90901e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 3.81818e+07


* Info: Process finished (host turpancomp0, process 673783)
* Info: Dumping samples (host turpancomp0, process 673783)
* Info: Dumping source info for callchain nodes (host turpancomp0, process 673783)
* Info: Building/writing metadata (host turpancomp0)
* Info: Finished collect step (host turpancomp0, process 673783)

Your experiment path is /work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0

To display your profiling results:
###########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                 COMMAND                                                                                  #
###########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/work/m23012/camus/qaas-runs/171-102-6087/intel/miniqmc/run/oneview_runs/compilers/armclang_5/oneview_results_1711027961/tools/lprof_npsu_run_0  #
###########################################################################################################################################################################################################

×