* Info: Detected 1 Lprof instances in ip-172-31-68-94: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Warning: perf-events measurements are not allowed on node ip-172-31-68-94: selecting the no-perf engine. Try:
sudo sysctl -w kernel.perf_event_paranoid=1 (*)
To persist across reboots:
sudo sh -c 'echo kernel.perf_event_paranoid=1 >> /etc/sysctl.d/local.conf' (*)
(*) requires sudo permissions. If missing, contact administrators.
=1 allows both kernel+user-space measurements (=2: only user-space)
* Warning: The 'no-perf' engine is feature-limited and suffers higher overhead than other engines. It should be used only when perf-events are not available on the running Linux kernel - for instance with WSL1 (Windows Subsystem for Linux version 1) - or when the paranoid level (as displayed by 'sysctl kernel.perf_event_paranoid') cannot be lowered to 2 or less.
* Info: Process launched (host ip-172-31-68-94, process 544326)Wed Oct 18 15:50:15 2023: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: ip-172-31-68-94
kernel name: 'Linux'
kernel release: '6.2.0-1013-aws'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.10.0/bin/mpiicc'
compiler version: 'icc: remark #10441: The Intel(R) C++ Compiler Classic (ICC) is deprecated and will be removed from product release in the second half of 2023. The Intel(R) oneAPI DPC++/C++ Compiler (ICX) is the recommended compiler moving forward. Please transition to use this compiler. Use '-diag-disable=10441' to disable this message.'
CFLAGS: '-O3 -march=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=clang -fopenmp=libomp'
LDFLAGS: ' '
using MPI: false
Threading: OpenMP (1 threads)
Double Precision: true
Run Date/Time: 2023-10-18, 15:50:15
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 40
ny: 40
nz: 40
xproc: 1
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 256000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 144.6000000000, 144.6000000000, 144.6000000000 ]
Decomposition data:
Processors : 1, 1, 1
Local boxes : 24, 24, 24 = 13824
Box size : [ 6.0250000000, 6.0250000000, 6.0250000000 ]
Box factor : [ 1.0410367171, 1.0410367171, 1.0410367171 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303668, atom count : 256000
Wed Oct 18 15:50:15 2023: Initialization Finished
Wed Oct 18 15:50:15 2023: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303668 -1.243619295268 0.077555991600 600.0000 0.0000 256000
10 10.00 -1.166059671303 -1.233152596044 0.067092924741 519.0541 0.9698 256000
20 20.00 -1.166048444584 -1.208161794319 0.042113349735 325.8035 1.2810 256000
30 30.00 -1.166037581449 -1.186571716246 0.020534134797 158.8592 1.4448 256000
40 40.00 -1.166042139511 -1.183655183678 0.017613044167 136.2606 1.4905 256000
50 50.00 -1.166051731640 -1.193748648207 0.027696916567 214.2729 1.5014 256000
60 60.00 -1.166054642856 -1.202645604514 0.036590961658 283.0803 1.5001 256000
70 70.00 -1.166052104338 -1.204856516141 0.038804411803 300.2044 1.4941 256000
80 80.00 -1.166048785823 -1.203606172709 0.037557386886 290.5569 1.4861 256000
90 90.00 -1.166048047012 -1.203854970624 0.037806923612 292.4874 1.4776 256000
100 100.00 -1.166049848389 -1.206925106266 0.040875257877 316.2251 1.4660 256000
Wed Oct 18 15:50:51 2023: Ending simulation
Simulation Validation:
Initial energy : -1.166063303668
Final energy : -1.166049848389
eFinal/eInitial : 0.999988
Final atom count : 256000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 36.4491 36.4491 100.90
loop 1 36.1252 36.1252 100.00
timestep 10 3.6125 36.1251 100.00
position 100 0.0024 0.2363 0.65
velocity 200 0.0011 0.2193 0.61
redistribute 101 0.0146 1.4754 4.08
atomHalo 101 0.0051 0.5126 1.42
force 101 0.3408 34.4216 95.28
commHalo 303 0.0005 0.1497 0.41
commReduce 39 0.0000 0.0000 0.00
Timing Statistics Across 1 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 36.4491 0: 36.4491 36.4491 0.0000
loop 0: 36.1252 0: 36.1252 36.1252 0.0000
timestep 0: 36.1251 0: 36.1251 36.1251 0.0000
position 0: 0.2363 0: 0.2363 0.2363 0.0000
velocity 0: 0.2193 0: 0.2193 0.2193 0.0000
redistribute 0: 1.4754 0: 1.4754 1.4754 0.0000
atomHalo 0: 0.5126 0: 0.5126 0.5126 0.0000
force 0: 34.4216 0: 34.4216 34.4216 0.0000
commHalo 0: 0.1497 0: 0.1497 0.1497 0.0000
commReduce 0: 0.0000 0: 0.0000 0.0000 0.0000
---------------------------------------------------
Average atom update rate: 1.41 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 1.41 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 0.71 atoms/us
---------------------------------------------------
Wed Oct 18 15:50:51 2023: CoMD Ending
* Info: Process finished (host ip-172-31-68-94, process 544326)
* Warning: Restricted access to kernel symbols:
to see kernel functions in profiling results, reprofile as root
or execute sudo sysctl -w kernel.kptr_restrict=0.
To make kptr_restrict=0 persist across reboots:
sudo sh -c "echo kernel.kptr_restrict=0 >> /etc/sysctl.d/local.conf"
Your experiment path is /home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0
To display your profiling results:
##################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-764-3754/intel/CoMD/run/oneview_runs/orig/oneview_results_1697644214/tools/lprof_npsu_run_0 #
##################################################################################################################################################################################