options

Executable Output


* Info: Detected 2 Lprof instances in itp09.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14  Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_icx_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank    Pid      Node name                          Pin cpu
[0] MPI startup(): 0       2323329  itp09.benchmarkcenter.megware.com  {0}
[0] MPI startup(): 1       2323415  itp09.benchmarkcenter.megware.com  {36}
miniqmc not built from git repository

number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 36
Number of walkers per rank = 36

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 

Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer                                       Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                          0.0574     0.0574              1       0.057408256
  ParticleSet:::update                         0.0000     0.0000              1       0.000003348
Total                                        133.8591     0.0204              1     133.859070391
  Diffusion                                   83.8194     0.0641              5      16.763877669
    Complete Updates                           0.7044     0.0000              5       0.140875101
      DeterminantRef::update                   0.7043     0.7043             10       0.070433125
    Current Gradient                           3.7641     0.0502          30720       0.000122528
      DeterminantRef::ratio                    3.6932     3.6932          30720       0.000120220
      OneBodyJastrowRef                        0.0120     0.0120          30720       0.000000389
      TwoBodyJastrowRef                        0.0087     0.0087          30720       0.000000284
    Kinetic Energy                             0.8767     0.8756              5       0.175346557
      OneBodyJastrowRef                        0.0005     0.0005              5       0.000102565
      TwoBodyJastrowRef                        0.0006     0.0006              5       0.000117510
    New Gradient                              21.7186     0.0576          30720       0.000706985
      DeterminantRef::ratio                    0.3270     0.3270          30720       0.000010644
      DeterminantRef::spovgl                  19.3431     1.1244          30720       0.000629658
        Single-Particle Orbitals              18.2187    18.2187          30720       0.000593057
      OneBodyJastrowRef                        0.2007     0.2007          30720       0.000006535
      TwoBodyJastrowRef                        1.7902     1.7902          30720       0.000058274
    ParticleSet:::acceptMove                   9.4109     0.0468          15371       0.000612249
      DTAAOMPTarget::update_e_e                9.2863     9.2863          15371       0.000604146
      DTABOMPTarget::update_ion_e              0.0778     0.0778          15371       0.000005060
    ParticleSet:::computeNewPosDT              3.2837     0.0326          30720       0.000106890
      DTAAOMPTarget::move_e_e                  3.0032     3.0032          30720       0.000097759
      DTABOMPTarget::move_ion_e                0.2479     0.2479          30720       0.000008071
    ParticleSet:::donePbyP                     0.0000     0.0000              5       0.000001887
    Update                                    43.9970     0.0233          15371       0.002862338
      DeterminantRef::update                  41.2809    41.2809          15371       0.002685632
      OneBodyJastrowRef                        0.0083     0.0083          15371       0.000000542
      TwoBodyJastrowRef                        2.6845     2.6845          15371       0.000174649
  Initialization                               8.0244     3.0509              1       8.024394253
    DeterminantRef::inverse                    1.6165     1.6165              2       0.808242599
    DeterminantRef::spovgl                     2.9002     0.1763              2       1.450115208
      Single-Particle Orbitals                 2.7240     2.7240           6144       0.000443354
    OneBodyJastrowRef                          0.0161     0.0161              1       0.016140430
    ParticleSet:::update                       0.2758     0.1458              2       0.137915366
      DTAAOMPTarget::evaluate_e_e              0.1011     0.1011              1       0.101097808
      DTABOMPTarget::evaluate_ion_e            0.0290     0.0051              1       0.028977271
        DTABOMPTarget::offload_ion_e           0.0239     0.0239              1       0.023855423
    TwoBodyJastrowRef                          0.1648     0.1648              1       0.164790573
  Pseudopotential                             41.9949     0.1287              5       8.398972834
    DeterminantRef::spoval                    30.3064     0.6462          10215       0.002966855
      Single-Particle Orbitals                29.6602    29.6602         122580       0.000241966
    OneBodyJastrowRef                          0.0768     0.0768          10215       0.000007519
    ParticleSet:::update                       9.1529     0.0294          10215       0.000896025
      DTABOMPTarget::evaluate_e_virtual        8.3124     0.0120          10215       0.000813749
        DTABOMPTarget::offload_e_virtual       8.3004     8.3004          10215       0.000812571
      DTABOMPTarget::evaluate_ion_virtual      0.8111     0.0092          10215       0.000079399
        DTABOMPTarget::offload_ion_virtual     0.8019     0.8019          10215       0.000078500
    TwoBodyJastrowRef                          2.3301     2.3301          10215       0.000228104

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.24749e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.99224e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 6.472e+07



Info: 1/2 lprof instances finished


Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0

To display your profiling results:
##########################################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                         COMMAND                                                                                         #
##########################################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9252/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1744121461/tools/lprof_npsu_run_0  #
##########################################################################################################################################################################################################################

×