options

Executable Output


* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14  Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank    Pid      Node name                          Pin cpu
[0] MPI startup(): 0       456937   gmz10.benchmarkcenter.megware.com  {0}
[0] MPI startup(): 1       456948   gmz10.benchmarkcenter.megware.com  {32}
[0] MPI startup(): 2       456946   gmz10.benchmarkcenter.megware.com  {64}
[0] MPI startup(): 3       456945   gmz10.benchmarkcenter.megware.com  {96}
[0] MPI startup(): 4       456943   gmz10.benchmarkcenter.megware.com  {128}
[0] MPI startup(): 5       456941   gmz10.benchmarkcenter.megware.com  {160}
[0] MPI startup(): 6       456964   gmz10.benchmarkcenter.megware.com  {192}
[0] MPI startup(): 7       456942   gmz10.benchmarkcenter.megware.com  {224}
miniqmc not built from git repository

number of ranks : 8, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 8
OpenMP threads = 32
Number of walkers per rank = 32

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0600     0.0600              1       0.060007061
  ParticleSet:::update                         0.0000     0.0000              1       0.000000550
Total                                        167.0595     0.0021              1     167.059526840
  Diffusion                                  107.4603     0.0349              5      21.492058748
    Complete Updates                           1.1361     0.0000              5       0.227220949
      DeterminantRef::update                   1.1361     1.1361             10       0.113607001
    Current Gradient                           2.5839     0.0202          30720       0.000084112
      DeterminantRef::ratio                    2.5550     2.5550          30720       0.000083169
      OneBodyJastrowRef                        0.0053     0.0053          30720       0.000000172
      TwoBodyJastrowRef                        0.0035     0.0035          30720       0.000000114
    Kinetic Energy                             1.0781     1.0771              5       0.215621380
      OneBodyJastrowRef                        0.0006     0.0006              5       0.000123034
      TwoBodyJastrowRef                        0.0004     0.0004              5       0.000083440
    New Gradient                              15.7443     0.0232          30720       0.000512509
      DeterminantRef::ratio                    0.0801     0.0801          30720       0.000002609
      DeterminantRef::spovgl                  14.8456     0.3445          30720       0.000483257
        Single-Particle Orbitals              14.5012    14.5012          30720       0.000472043
      OneBodyJastrowRef                        0.0882     0.0882          30720       0.000002873
      TwoBodyJastrowRef                        0.7071     0.7071          30720       0.000023016
    ParticleSet:::acceptMove                   3.8409     0.0215          15371       0.000249882
      DTAAOMPTarget::update_e_e                3.7503     3.7503          15371       0.000243986
      DTABOMPTarget::update_ion_e              0.0691     0.0691          15371       0.000004496
    ParticleSet:::computeNewPosDT              1.4582     0.0116          30720       0.000047467
      DTAAOMPTarget::move_e_e                  1.2792     1.2792          30720       0.000041641
      DTABOMPTarget::move_ion_e                0.1674     0.1674          30720       0.000005449
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001626
    Update                                    81.5838     0.0173          15371       0.005307646
      DeterminantRef::update                  79.3881    79.3881          15371       0.005164795
      OneBodyJastrowRef                        0.0020     0.0020          15371       0.000000129
      TwoBodyJastrowRef                        2.1765     2.1765          15371       0.000141598
  Initialization                               9.2813     1.3024              1       9.281330344
    DeterminantRef::inverse                    4.9411     4.9411              2       2.470526148
    DeterminantRef::spovgl                     2.5268     0.1729              2       1.263420511
      Single-Particle Orbitals                 2.3540     2.3540           6144       0.000383132
    OneBodyJastrowRef                          0.0096     0.0096              1       0.009572802
    ParticleSet:::update                       0.3995     0.0487              2       0.199735158
      DTAAOMPTarget::evaluate_e_e              0.3115     0.3115              1       0.311499634
      DTABOMPTarget::evaluate_ion_e            0.0392     0.0001              1       0.039231449
        DTABOMPTarget::offload_ion_e           0.0391     0.0391              1       0.039095539
    TwoBodyJastrowRef                          0.1020     0.1020              1       0.101981815
  Pseudopotential                             50.3158     0.1332              5      10.063166156
    DeterminantRef::spoval                    40.5077     0.4034          10215       0.003965511
      Single-Particle Orbitals                40.1043    40.1043         122580       0.000327169
    OneBodyJastrowRef                          0.0637     0.0637          10215       0.000006236
    ParticleSet:::update                       7.7377     0.0216          10215       0.000757489
      DTABOMPTarget::evaluate_e_virtual        7.0688     0.0091          10215       0.000692005
        DTABOMPTarget::offload_e_virtual       7.0598     7.0598          10215       0.000691119
      DTABOMPTarget::evaluate_ion_virtual      0.6473     0.0083          10215       0.000063368
        DTABOMPTarget::offload_ion_virtual     0.6390     0.6390          10215       0.000062559
    TwoBodyJastrowRef                          1.8735     1.8735          10215       0.000183406

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.55404e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.52517e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.9206e+08



Info: 7/8 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0

To display your profiling results:
######################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                               COMMAND                                                                               #
######################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/compilers/gcc_3/oneview_results_1739399676/tools/lprof_npsu_run_0  #
######################################################################################################################################################################################################

×