* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 397577 gmz10.benchmarkcenter.megware.com {0}
[0] MPI startup(): 1 397553 gmz10.benchmarkcenter.megware.com {32}
[0] MPI startup(): 2 397559 gmz10.benchmarkcenter.megware.com {64}
[0] MPI startup(): 3 397555 gmz10.benchmarkcenter.megware.com {96}
[0] MPI startup(): 4 397550 gmz10.benchmarkcenter.megware.com {128}
[0] MPI startup(): 5 397557 gmz10.benchmarkcenter.megware.com {160}
[0] MPI startup(): 6 397551 gmz10.benchmarkcenter.megware.com {192}
[0] MPI startup(): 7 397556 gmz10.benchmarkcenter.megware.com {224}
miniqmc not built from git repository
number of ranks : 8, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 8
OpenMP threads = 32
Number of walkers per rank = 32
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0577 0.0577 1 0.057685486
ParticleSet:::update 0.0000 0.0000 1 0.000000380
Total 167.5654 0.0791 1 167.565422126
Diffusion 107.4806 0.0351 5 21.496125225
Complete Updates 1.1446 0.0000 5 0.228928441
DeterminantRef::update 1.1446 1.1446 10 0.114460212
Current Gradient 2.5607 0.0204 30720 0.000083355
DeterminantRef::ratio 2.5315 2.5315 30720 0.000082405
OneBodyJastrowRef 0.0053 0.0053 30720 0.000000171
TwoBodyJastrowRef 0.0036 0.0036 30720 0.000000116
Kinetic Energy 1.0750 1.0740 5 0.215000606
OneBodyJastrowRef 0.0006 0.0006 5 0.000127964
TwoBodyJastrowRef 0.0004 0.0004 5 0.000081690
New Gradient 16.0963 0.0250 30720 0.000523968
DeterminantRef::ratio 0.0509 0.0509 30720 0.000001656
DeterminantRef::spovgl 15.3470 0.3830 30720 0.000499577
Single-Particle Orbitals 14.9640 14.9640 30720 0.000487110
OneBodyJastrowRef 0.0731 0.0731 30720 0.000002380
TwoBodyJastrowRef 0.6003 0.6003 30720 0.000019540
ParticleSet:::acceptMove 3.9211 0.0228 15371 0.000255096
DTAAOMPTarget::update_e_e 3.8280 3.8280 15371 0.000249039
DTABOMPTarget::update_ion_e 0.0702 0.0702 15371 0.000004570
ParticleSet:::computeNewPosDT 1.2237 0.0121 30720 0.000039835
DTAAOMPTarget::move_e_e 1.0491 1.0491 30720 0.000034151
DTABOMPTarget::move_ion_e 0.1625 0.1625 30720 0.000005289
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001470
Update 81.4241 0.0166 15371 0.005297256
DeterminantRef::update 79.2259 79.2259 15371 0.005154244
OneBodyJastrowRef 0.0018 0.0018 15371 0.000000116
TwoBodyJastrowRef 2.1798 2.1798 15371 0.000141813
Initialization 9.4968 1.1902 1 9.496829463
DeterminantRef::inverse 5.1958 5.1958 2 2.597875685
DeterminantRef::spovgl 2.6335 0.1782 2 1.316753817
Single-Particle Orbitals 2.4553 2.4553 6144 0.000399623
OneBodyJastrowRef 0.0139 0.0139 1 0.013921360
ParticleSet:::update 0.3873 0.0448 2 0.193645350
DTAAOMPTarget::evaluate_e_e 0.3017 0.3017 1 0.301669622
DTABOMPTarget::evaluate_ion_e 0.0408 0.0002 1 0.040785923
DTABOMPTarget::offload_ion_e 0.0406 0.0406 1 0.040628913
TwoBodyJastrowRef 0.0762 0.0762 1 0.076156546
Pseudopotential 50.5089 0.1305 5 10.101770031
DeterminantRef::spoval 40.6916 0.3797 10215 0.003983515
Single-Particle Orbitals 40.3119 40.3119 122580 0.000328862
OneBodyJastrowRef 0.0628 0.0628 10215 0.000006147
ParticleSet:::update 7.6997 0.0250 10215 0.000753768
DTABOMPTarget::evaluate_e_virtual 7.0186 0.0097 10215 0.000687088
DTABOMPTarget::offload_e_virtual 7.0089 7.0089 10215 0.000686141
DTABOMPTarget::evaluate_ion_virtual 0.6562 0.0086 10215 0.000064237
DTABOMPTarget::offload_ion_virtual 0.6476 0.6476 10215 0.000063396
TwoBodyJastrowRef 1.9243 1.9243 10215 0.000188375
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.54331e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.52412e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.91326e+08
Your experiment path is /home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/173-938-0042/intel/miniqmc/run/oneview_runs/defaults/gcc/oneview_results_1739383532/tools/lprof_npsu_run_0 #
###################################################################################################################################################################################################