options

Executable Output


* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14  Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank    Pid      Node name                          Pin cpu
[0] MPI startup(): 0       40948    gmz10.benchmarkcenter.megware.com  {0}
[0] MPI startup(): 1       40955    gmz10.benchmarkcenter.megware.com  {32}
[0] MPI startup(): 2       40973    gmz10.benchmarkcenter.megware.com  {64}
[0] MPI startup(): 3       40954    gmz10.benchmarkcenter.megware.com  {96}
[0] MPI startup(): 4       40947    gmz10.benchmarkcenter.megware.com  {128}
[0] MPI startup(): 5       40957    gmz10.benchmarkcenter.megware.com  {160}
[0] MPI startup(): 6       40946    gmz10.benchmarkcenter.megware.com  {192}
[0] MPI startup(): 7       40949    gmz10.benchmarkcenter.megware.com  {224}
Mon Feb 24 23:02:09 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: gmz10.benchmarkcenter.megware.com
  kernel name: 'Linux'
  kernel release: '5.14.0-503.19.1.el9_5.x86_64'
  processor: 'x86_64'
Build:
  CC: '/cluster/intel/oneapi/2025.0.0/mpi/2021.14/bin/mpiicc'
  compiler version: 'unknown'
  CFLAGS: '-O3 -march=native -DDO_MPI -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cc=gcc -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (32 threads) 
  Double Precision: true
Run Date/Time: 2025-02-24, 23:02:09

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 200
  ny: 200
  nz: 200
  xproc: 2
  yproc: 2
  zproc: 2
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 32000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 723.0000000000, 723.0000000000, 723.0000000000 ]

Decomposition data: 
  Processors         :      2,     2,     2
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303487, atom count : 32000000 

Mon Feb 24 23:02:09 2025: Initialization Finished

Mon Feb 24 23:02:09 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303487    -1.243619295087     0.077555991600     600.0000     0.0000     32000000
     10      10.00    -1.166059648980    -1.233154817498     0.067095168517     519.0715     0.0541     32000000
     20      20.00    -1.166048431576    -1.208173842947     0.042125411370     325.8968     0.0691     32000000
     30      30.00    -1.166037581951    -1.186576153828     0.020538571877     158.8935     0.0718     32000000
     40      40.00    -1.166042092491    -1.183622817462     0.017580724971     136.0106     0.0725     32000000
     50      50.00    -1.166051684603    -1.193715522562     0.027663837959     214.0170     0.0728     32000000
     60      60.00    -1.166054640401    -1.202662241274     0.036607600874     283.2091     0.0728     32000000
     70      70.00    -1.166052133313    -1.204912537669     0.038860404356     300.6375     0.0727     32000000
     80      80.00    -1.166048797816    -1.203644675872     0.037595878056     290.8547     0.0725     32000000
     90      90.00    -1.166048009496    -1.203841392163     0.037793382667     292.3827     0.0724     32000000
    100     100.00    -1.166049798760    -1.206885628636     0.040835829876     315.9201     0.0722     32000000
Mon Feb 24 23:02:37 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303487
  Final energy    : -1.166049798760
  eFinal/eInitial : 0.999988
  Final atom count : 32000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      28.5471       28.5471      101.53
loop                       1      28.1170       28.1170      100.00
timestep                  10       2.8116       28.1156      100.00
  position               100       0.0046        0.4628        1.65
  velocity               200       0.0028        0.5519        1.96
  redistribute           101       0.0630        6.3593       22.62
    atomHalo             101       0.0170        1.7137        6.10
  force                  101       0.2068       20.8859       74.28
commHalo                 303       0.0016        0.4938        1.76
commReduce                39       0.0005        0.0208        0.07

Timing Statistics Across 8 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   28.5471       5:   28.5472     28.5471      0.0001
loop                 0:   28.1170       1:   28.1170     28.1170      0.0000
timestep             0:   28.1156       5:   28.1158     28.1158      0.0001
  position           4:    0.4618       2:    0.4640      0.4631      0.0007
  velocity           6:    0.5486       5:    0.5537      0.5505      0.0016
  redistribute       1:    6.2333       0:    6.3593      6.3140      0.0393
    atomHalo         1:    1.6216       0:    1.7137      1.6865      0.0299
  force              0:   20.8859       1:   21.0196     20.9364      0.0412
commHalo             7:    0.3853       3:    0.5438      0.4395      0.0505
commReduce           1:    0.0094       3:    0.0313      0.0157      0.0069

---------------------------------------------------
 Average atom update rate:       0.07 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.01 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:            113.82 atoms/us
---------------------------------------------------

Mon Feb 24 23:02:37 2025: CoMD Ending



Your experiment path is /home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0

To display your profiling results:
################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                            #
################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/174-043-3878/intel/CoMD/run/oneview_runs/defaults/gcc/oneview_results_1740434525/tools/lprof_npsu_run_0  #
################################################################################################################################################################################################

×