* Info: Detected 2 Lprof instances in icp01.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_icx_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 26219 icp01.benchmarkcenter.megware.com {0}
[0] MPI startup(): 1 26304 icp01.benchmarkcenter.megware.com {36}
Tue Apr 8 16:13:51 2025: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: icp01.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-503.16.1.el9_5.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2025.0.0/mpi/2021.14/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O3 -march=icelake-server -fno-tree-vectorize -fno-openmp-simd -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (36 threads)
Double Precision: true
Run Date/Time: 2025-04-08, 16:13:51
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303512, atom count : 4000000
Tue Apr 8 16:13:52 2025: Initialization Finished
Tue Apr 8 16:13:52 2025: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303512 -1.243619295112 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.0805 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.0845 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.0841 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.0836 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.0839 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.0838 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.0839 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.0838 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.0838 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.0838 4000000
Tue Apr 8 16:14:09 2025: Ending simulation
Simulation Validation:
Initial energy : -1.166063303512
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 17.1761 17.1761 102.76
loop 1 16.7140 16.7140 100.00
timestep 10 1.6713 16.7132 99.99
position 100 0.0017 0.1685 1.01
velocity 200 0.0016 0.3146 1.88
redistribute 101 0.0479 4.8336 28.92
atomHalo 101 0.0206 2.0806 12.45
force 101 0.1142 11.5312 68.99
commHalo 303 0.0019 0.5763 3.45
commReduce 39 0.0002 0.0082 0.05
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 17.1761 1: 17.1764 17.1762 0.0001
loop 0: 16.7140 1: 16.7141 16.7140 0.0000
timestep 0: 16.7132 1: 16.7134 16.7133 0.0001
position 1: 0.1661 0: 0.1685 0.1673 0.0012
velocity 1: 0.3076 0: 0.3146 0.3111 0.0035
redistribute 1: 4.7317 0: 4.8336 4.7827 0.0510
atomHalo 1: 2.0008 0: 2.0806 2.0407 0.0399
force 0: 11.5312 1: 11.6506 11.5909 0.0597
commHalo 1: 0.5246 0: 0.5763 0.5505 0.0259
commReduce 1: 0.0043 0: 0.0082 0.0063 0.0020
---------------------------------------------------
Average atom update rate: 0.08 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.04 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 23.93 atoms/us
---------------------------------------------------
Tue Apr 8 16:14:09 2025: CoMD Ending
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0
To display your profiling results:
##########################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##########################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_CPU_8360Y/174-411-9946/intel/CoMD/run/oneview_runs/compilers/gcc_5/oneview_results_1744121627/tools/lprof_npsu_run_0 #
##########################################################################################################################################################################################################################