* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 40948 gmz10.benchmarkcenter.megware.com {0}
[0] MPI startup(): 1 40955 gmz10.benchmarkcenter.megware.com {32}
[0] MPI startup(): 2 40973 gmz10.benchmarkcenter.megware.com {64}
[0] MPI startup(): 3 40954 gmz10.benchmarkcenter.megware.com {96}
[0] MPI startup(): 4 40947 gmz10.benchmarkcenter.megware.com {128}
[0] MPI startup(): 5 40957 gmz10.benchmarkcenter.megware.com {160}
[0] MPI startup(): 6 40946 gmz10.benchmarkcenter.megware.com {192}
[0] MPI startup(): 7 40949 gmz10.benchmarkcenter.megware.com {224}
Mon Feb 24 23:02:09 2025: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: gmz10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-503.19.1.el9_5.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2025.0.0/mpi/2021.14/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -DDO_MPI -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (32 threads)
Double Precision: true
Run Date/Time: 2025-02-24, 23:02:09
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 200
ny: 200
nz: 200
xproc: 2
yproc: 2
zproc: 2
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 32000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 723.0000000000, 723.0000000000, 723.0000000000 ]
Decomposition data:
Processors : 2, 2, 2
Local boxes : 62, 62, 62 = 238328
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303487, atom count : 32000000
Mon Feb 24 23:02:09 2025: Initialization Finished
Mon Feb 24 23:02:09 2025: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303487 -1.243619295087 0.077555991600 600.0000 0.0000 32000000
10 10.00 -1.166059648980 -1.233154817498 0.067095168517 519.0715 0.0541 32000000
20 20.00 -1.166048431576 -1.208173842947 0.042125411370 325.8968 0.0691 32000000
30 30.00 -1.166037581951 -1.186576153828 0.020538571877 158.8935 0.0718 32000000
40 40.00 -1.166042092491 -1.183622817462 0.017580724971 136.0106 0.0725 32000000
50 50.00 -1.166051684603 -1.193715522562 0.027663837959 214.0170 0.0728 32000000
60 60.00 -1.166054640401 -1.202662241274 0.036607600874 283.2091 0.0728 32000000
70 70.00 -1.166052133313 -1.204912537669 0.038860404356 300.6375 0.0727 32000000
80 80.00 -1.166048797816 -1.203644675872 0.037595878056 290.8547 0.0725 32000000
90 90.00 -1.166048009496 -1.203841392163 0.037793382667 292.3827 0.0724 32000000
100 100.00 -1.166049798760 -1.206885628636 0.040835829876 315.9201 0.0722 32000000
Mon Feb 24 23:02:37 2025: Ending simulation
Simulation Validation:
Initial energy : -1.166063303487
Final energy : -1.166049798760
eFinal/eInitial : 0.999988
Final atom count : 32000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 28.5471 28.5471 101.53
loop 1 28.1170 28.1170 100.00
timestep 10 2.8116 28.1156 100.00
position 100 0.0046 0.4628 1.65
velocity 200 0.0028 0.5519 1.96
redistribute 101 0.0630 6.3593 22.62
atomHalo 101 0.0170 1.7137 6.10
force 101 0.2068 20.8859 74.28
commHalo 303 0.0016 0.4938 1.76
commReduce 39 0.0005 0.0208 0.07
Timing Statistics Across 8 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 28.5471 5: 28.5472 28.5471 0.0001
loop 0: 28.1170 1: 28.1170 28.1170 0.0000
timestep 0: 28.1156 5: 28.1158 28.1158 0.0001
position 4: 0.4618 2: 0.4640 0.4631 0.0007
velocity 6: 0.5486 5: 0.5537 0.5505 0.0016
redistribute 1: 6.2333 0: 6.3593 6.3140 0.0393
atomHalo 1: 1.6216 0: 1.7137 1.6865 0.0299
force 0: 20.8859 1: 21.0196 20.9364 0.0412
commHalo 7: 0.3853 3: 0.5438 0.4395 0.0505
commReduce 1: 0.0094 3: 0.0313 0.0157 0.0069
---------------------------------------------------
Average atom update rate: 0.07 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.01 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 113.82 atoms/us
---------------------------------------------------
Mon Feb 24 23:02:37 2025: CoMD Ending
Your experiment path is /home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0
To display your profiling results:
################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0 #
################################################################################################################################################################################################