options

Executable Output


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:18:42 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (1 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:18:42

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303159, atom count : 4000000 

Tue Mar  4 14:18:47 2025: Initialization Finished

Tue Mar  4 14:18:47 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303159    -1.243619294759     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     1.0950      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     1.2986      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     1.3542      4000000
     40      40.00    -1.166042093136    -1.183625399860     0.017583306724     136.0305     1.3696      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     1.3737      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     1.3728      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     1.3704      4000000
     80      80.00    -1.166048803911    -1.203635015019     0.037586211108     290.7799     1.3669      4000000
     90      90.00    -1.166048006782    -1.203820491600     0.037772484818     292.2210     1.3635      4000000
    100     100.00    -1.166049793505    -1.206862845061     0.040813051556     315.7439     1.3597      4000000
Tue Mar  4 14:27:40 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303159
  Final energy    : -1.166049793505
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1     537.8877      537.8877      100.92
loop                       1     532.9816      532.9816      100.00
timestep                  10      53.2981      532.9805      100.00
  position               100       0.0152        1.5156        0.28
  velocity               200       0.0133        2.6623        0.50
  redistribute           101       0.0924        9.3292        1.75
    atomHalo             101       0.0177        1.7869        0.34
  force                  101       5.1820      523.3834       98.20
commHalo                 303       0.0013        0.4044        0.08
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:  537.8877       0:  537.8877    537.8877      0.0000
loop                 0:  532.9816       0:  532.9816    532.9816      0.0000
timestep             0:  532.9805       0:  532.9805    532.9805      0.0000
  position           0:    1.5156       0:    1.5156      1.5156      0.0000
  velocity           0:    2.6623       0:    2.6623      2.6623      0.0000
  redistribute       0:    9.3292       0:    9.3292      9.3292      0.0000
    atomHalo         0:    1.7869       0:    1.7869      1.7869      0.0000
  force              0:  523.3834       0:  523.3834    523.3834      0.0000
commHalo             0:    0.4044       0:    0.4044      0.4044      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       1.33 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   1.33 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:              0.75 atoms/us
---------------------------------------------------

Tue Mar  4 14:27:40 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_0  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:27:46 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (2 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:27:46

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303588, atom count : 4000000 

Tue Mar  4 14:27:48 2025: Initialization Finished

Tue Mar  4 14:27:48 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303588    -1.243619295188     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650499    -1.233157709948     0.067098059449     519.0938     0.5507      4000000
     20      20.00    -1.166048438417    -1.208183014318     0.042134575902     325.9677     0.6517      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.6792      4000000
     40      40.00    -1.166042093135    -1.183625399859     0.017583306724     136.0305     0.6868      4000000
     50      50.00    -1.166051684893    -1.193713710257     0.027662025365     214.0030     0.6888      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.6886      4000000
     70      70.00    -1.166052143011    -1.204911990845     0.038859847833     300.6332     0.6874      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.6857      4000000
     90      90.00    -1.166048006781    -1.203820491599     0.037772484818     292.2210     0.6841      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.6821      4000000
Tue Mar  4 14:32:16 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303588
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1     270.0490      270.0490      100.99
loop                       1     267.3988      267.3988      100.00
timestep                  10      26.7398      267.3977      100.00
  position               100       0.0079        0.7857        0.29
  velocity               200       0.0068        1.3590        0.51
  redistribute           101       0.0624        6.3058        2.36
    atomHalo             101       0.0177        1.7880        0.67
  force                  101       2.5833      260.9138       97.57
commHalo                 303       0.0014        0.4129        0.15
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:  270.0490       0:  270.0490    270.0490      0.0000
loop                 0:  267.3988       0:  267.3988    267.3988      0.0000
timestep             0:  267.3977       0:  267.3977    267.3977      0.0000
  position           0:    0.7857       0:    0.7857      0.7857      0.0000
  velocity           0:    1.3590       0:    1.3590      1.3590      0.0000
  redistribute       0:    6.3058       0:    6.3058      6.3058      0.0000
    atomHalo         0:    1.7880       0:    1.7880      1.7880      0.0000
  force              0:  260.9138       0:  260.9138    260.9138      0.0000
commHalo             0:    0.4129       0:    0.4129      0.4129      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.67 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.67 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:              1.50 atoms/us
---------------------------------------------------

Tue Mar  4 14:32:16 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_1  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:32:21 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (4 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:32:21

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063304092, atom count : 4000000 

Tue Mar  4 14:32:23 2025: Initialization Finished

Tue Mar  4 14:32:23 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063304092    -1.243619295692     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650499    -1.233157709949     0.067098059449     519.0938     0.2804      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.3311      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.3447      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.3486      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.3497      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.3495      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.3489      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.3481      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.3473      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.3463      4000000
Tue Mar  4 14:34:39 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063304092
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1     137.3230      137.3230      101.13
loop                       1     135.7886      135.7886      100.00
timestep                  10      13.5787      135.7874      100.00
  position               100       0.0042        0.4194        0.31
  velocity               200       0.0036        0.7261        0.53
  redistribute           101       0.0494        4.9893        3.67
    atomHalo             101       0.0176        1.7776        1.31
  force                  101       1.2936      130.6541       96.22
commHalo                 303       0.0014        0.4104        0.30
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:  137.3230       0:  137.3230    137.3230      0.0000
loop                 0:  135.7886       0:  135.7886    135.7886      0.0000
timestep             0:  135.7874       0:  135.7874    135.7874      0.0000
  position           0:    0.4194       0:    0.4194      0.4194      0.0000
  velocity           0:    0.7261       0:    0.7261      0.7261      0.0000
  redistribute       0:    4.9893       0:    4.9893      4.9893      0.0000
    atomHalo         0:    1.7776       0:    1.7776      1.7776      0.0000
  force              0:  130.6541       0:  130.6541    130.6541      0.0000
commHalo             0:    0.4104       0:    0.4104      0.4104      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.34 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.34 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:              2.95 atoms/us
---------------------------------------------------

Tue Mar  4 14:34:39 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_2  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:34:44 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (8 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:34:44

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303067, atom count : 4000000 

Tue Mar  4 14:34:45 2025: Initialization Finished

Tue Mar  4 14:34:45 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303067    -1.243619294667     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650499    -1.233157709949     0.067098059449     519.0938     0.1464      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.1711      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.1781      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.1805      4000000
     50      50.00    -1.166051684893    -1.193713710257     0.027662025365     214.0030     0.1813      4000000
     60      60.00    -1.166054646930    -1.202662201512     0.036607554582     283.2087     0.1813      4000000
     70      70.00    -1.166052143011    -1.204911990845     0.038859847833     300.6332     0.1809      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.1804      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.1799      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.1793      4000000
Tue Mar  4 14:35:56 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303067
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      71.3542       71.3542      101.39
loop                       1      70.3747       70.3747      100.00
timestep                  10       7.0374       70.3735      100.00
  position               100       0.0025        0.2459        0.35
  velocity               200       0.0022        0.4361        0.62
  redistribute           101       0.0423        4.2674        6.06
    atomHalo             101       0.0177        1.7829        2.53
  force                  101       0.6529       65.9462       93.71
commHalo                 303       0.0013        0.4075        0.58
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   71.3542       0:   71.3542     71.3542      0.0000
loop                 0:   70.3747       0:   70.3747     70.3747      0.0000
timestep             0:   70.3735       0:   70.3735     70.3735      0.0000
  position           0:    0.2459       0:    0.2459      0.2459      0.0000
  velocity           0:    0.4361       0:    0.4361      0.4361      0.0000
  redistribute       0:    4.2674       0:    4.2674      4.2674      0.0000
    atomHalo         0:    1.7829       0:    1.7829      1.7829      0.0000
  force              0:   65.9462       0:   65.9462     65.9462      0.0000
commHalo             0:    0.4075       0:    0.4075      0.4075      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.18 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.18 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:              5.68 atoms/us
---------------------------------------------------

Tue Mar  4 14:35:56 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_3  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:36:01 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (16 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:36:01

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303663, atom count : 4000000 

Tue Mar  4 14:36:02 2025: Initialization Finished

Tue Mar  4 14:36:02 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303663    -1.243619295263     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0778      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0909      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0943      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0954      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0958      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0958      4000000
     70      70.00    -1.166052143011    -1.204911990845     0.038859847833     300.6332     0.0957      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0954      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0951      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0949      4000000
Tue Mar  4 14:36:39 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303663
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      37.9329       37.9329      101.86
loop                       1      37.2402       37.2402      100.00
timestep                  10       3.7239       37.2391      100.00
  position               100       0.0019        0.1855        0.50
  velocity               200       0.0014        0.2884        0.77
  redistribute           101       0.0364        3.6758        9.87
    atomHalo             101       0.0174        1.7566        4.72
  force                  101       0.3303       33.3645       89.59
commHalo                 303       0.0013        0.4049        1.09
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   37.9329       0:   37.9329     37.9329      0.0000
loop                 0:   37.2402       0:   37.2402     37.2402      0.0000
timestep             0:   37.2391       0:   37.2391     37.2391      0.0000
  position           0:    0.1855       0:    0.1855      0.1855      0.0000
  velocity           0:    0.2884       0:    0.2884      0.2884      0.0000
  redistribute       0:    3.6758       0:    3.6758      3.6758      0.0000
    atomHalo         0:    1.7566       0:    1.7566      1.7566      0.0000
  force              0:   33.3645       0:   33.3645     33.3645      0.0000
commHalo             0:    0.4049       0:    0.4049      0.4049      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.09 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.09 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             10.74 atoms/us
---------------------------------------------------

Tue Mar  4 14:36:39 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_4  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:36:45 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (24 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:36:45

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303545, atom count : 4000000 

Tue Mar  4 14:36:45 2025: Initialization Finished

Tue Mar  4 14:36:45 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303545    -1.243619295145     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0599      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0698      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0704      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0702      4000000
     50      50.00    -1.166051684893    -1.193713710257     0.027662025365     214.0030     0.0703      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0704      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0705      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0703      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0703      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0702      4000000
Tue Mar  4 14:37:13 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303545
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      28.2978       28.2978      102.20
loop                       1      27.6891       27.6891      100.00
timestep                  10       2.7688       27.6880      100.00
  position               100       0.0016        0.1592        0.57
  velocity               200       0.0012        0.2322        0.84
  redistribute           101       0.0365        3.6845       13.31
    atomHalo             101       0.0172        1.7362        6.27
  force                  101       0.2358       23.8150       86.01
commHalo                 303       0.0013        0.3976        1.44
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   28.2978       0:   28.2978     28.2978      0.0000
loop                 0:   27.6891       0:   27.6891     27.6891      0.0000
timestep             0:   27.6880       0:   27.6880     27.6880      0.0000
  position           0:    0.1592       0:    0.1592      0.1592      0.0000
  velocity           0:    0.2322       0:    0.2322      0.2322      0.0000
  redistribute       0:    3.6845       0:    3.6845      3.6845      0.0000
    atomHalo         0:    1.7362       0:    1.7362      1.7362      0.0000
  force              0:   23.8150       0:   23.8150     23.8150      0.0000
commHalo             0:    0.3976       0:    0.3976      0.3976      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.07 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.07 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             14.45 atoms/us
---------------------------------------------------

Tue Mar  4 14:37:13 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_5  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:37:19 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (32 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:37:19

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303487, atom count : 4000000 

Tue Mar  4 14:37:19 2025: Initialization Finished

Tue Mar  4 14:37:19 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303487    -1.243619295087     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0473      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0549      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0556      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0557      4000000
     50      50.00    -1.166051684893    -1.193713710257     0.027662025365     214.0030     0.0559      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0559      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0557      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0558      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0557      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0556      4000000
Tue Mar  4 14:37:41 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303487
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      22.4875       22.4875      102.58
loop                       1      21.9227       21.9227      100.00
timestep                  10       2.1922       21.9216       99.99
  position               100       0.0014        0.1420        0.65
  velocity               200       0.0012        0.2346        1.07
  redistribute           101       0.0345        3.4840       15.89
    atomHalo             101       0.0167        1.6840        7.68
  force                  101       0.1804       18.2252       83.13
commHalo                 303       0.0013        0.3838        1.75
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   22.4875       0:   22.4875     22.4875      0.0000
loop                 0:   21.9227       0:   21.9227     21.9227      0.0000
timestep             0:   21.9216       0:   21.9216     21.9216      0.0000
  position           0:    0.1420       0:    0.1420      0.1420      0.0000
  velocity           0:    0.2346       0:    0.2346      0.2346      0.0000
  redistribute       0:    3.4840       0:    3.4840      3.4840      0.0000
    atomHalo         0:    1.6840       0:    1.6840      1.6840      0.0000
  force              0:   18.2252       0:   18.2252     18.2252      0.0000
commHalo             0:    0.3838       0:    0.3838      0.3838      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.05 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.05 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             18.25 atoms/us
---------------------------------------------------

Tue Mar  4 14:37:41 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_6  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:37:47 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (40 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:37:47

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303502, atom count : 4000000 

Tue Mar  4 14:37:47 2025: Initialization Finished

Tue Mar  4 14:37:47 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303502    -1.243619295102     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0418      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0474      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0478      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0481      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0483      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0483      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0482      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0481      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0480      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0480      4000000
Tue Mar  4 14:38:06 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303502
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      19.4981       19.4981      102.85
loop                       1      18.9586       18.9586      100.00
timestep                  10       1.8957       18.9575       99.99
  position               100       0.0013        0.1307        0.69
  velocity               200       0.0011        0.2291        1.21
  redistribute           101       0.0348        3.5129       18.53
    atomHalo             101       0.0171        1.7248        9.10
  force                  101       0.1508       15.2261       80.31
commHalo                 303       0.0013        0.3936        2.08
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   19.4981       0:   19.4981     19.4981      0.0000
loop                 0:   18.9586       0:   18.9586     18.9586      0.0000
timestep             0:   18.9575       0:   18.9575     18.9575      0.0000
  position           0:    0.1307       0:    0.1307      0.1307      0.0000
  velocity           0:    0.2291       0:    0.2291      0.2291      0.0000
  redistribute       0:    3.5129       0:    3.5129      3.5129      0.0000
    atomHalo         0:    1.7248       0:    1.7248      1.7248      0.0000
  force              0:   15.2261       0:   15.2261     15.2261      0.0000
commHalo             0:    0.3936       0:    0.3936      0.3936      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.05 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.05 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             21.10 atoms/us
---------------------------------------------------

Tue Mar  4 14:38:06 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_7  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:38:12 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (48 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:38:12

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303516, atom count : 4000000 

Tue Mar  4 14:38:12 2025: Initialization Finished

Tue Mar  4 14:38:12 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303516    -1.243619295116     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0356      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0413      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0417      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0417      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0417      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0418      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0418      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0418      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0417      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0416      4000000
Tue Mar  4 14:38:29 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303516
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      16.9424       16.9424      103.11
loop                       1      16.4308       16.4308      100.00
timestep                  10       1.6430       16.4297       99.99
  position               100       0.0012        0.1229        0.75
  velocity               200       0.0010        0.2062        1.25
  redistribute           101       0.0352        3.5593       21.66
    atomHalo             101       0.0167        1.6847       10.25
  force                  101       0.1254       12.6604       77.05
commHalo                 303       0.0012        0.3780        2.30
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   16.9424       0:   16.9424     16.9424      0.0000
loop                 0:   16.4308       0:   16.4308     16.4308      0.0000
timestep             0:   16.4297       0:   16.4297     16.4297      0.0000
  position           0:    0.1229       0:    0.1229      0.1229      0.0000
  velocity           0:    0.2062       0:    0.2062      0.2062      0.0000
  redistribute       0:    3.5593       0:    3.5593      3.5593      0.0000
    atomHalo         0:    1.6847       0:    1.6847      1.6847      0.0000
  force              0:   12.6604       0:   12.6604     12.6604      0.0000
commHalo             0:    0.3780       0:    0.3780      0.3780      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.04 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.04 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             24.35 atoms/us
---------------------------------------------------

Tue Mar  4 14:38:29 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_8  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:38:34 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (56 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:38:34

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303525, atom count : 4000000 

Tue Mar  4 14:38:35 2025: Initialization Finished

Tue Mar  4 14:38:35 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303525    -1.243619295125     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0339      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0378      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0377      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0375      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0375      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0375      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0375      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0375      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0375      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0375      4000000
Tue Mar  4 14:38:50 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303525
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      15.3843       15.3843      103.40
loop                       1      14.8782       14.8782      100.00
timestep                  10       1.4877       14.8770       99.99
  position               100       0.0012        0.1164        0.78
  velocity               200       0.0010        0.2020        1.36
  redistribute           101       0.0357        3.6073       24.25
    atomHalo             101       0.0172        1.7398       11.69
  force                  101       0.1096       11.0655       74.37
commHalo                 303       0.0014        0.4112        2.76
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   15.3843       0:   15.3843     15.3843      0.0000
loop                 0:   14.8782       0:   14.8782     14.8782      0.0000
timestep             0:   14.8770       0:   14.8770     14.8770      0.0000
  position           0:    0.1164       0:    0.1164      0.1164      0.0000
  velocity           0:    0.2020       0:    0.2020      0.2020      0.0000
  redistribute       0:    3.6073       0:    3.6073      3.6073      0.0000
    atomHalo         0:    1.7398       0:    1.7398      1.7398      0.0000
  force              0:   11.0655       0:   11.0655     11.0655      0.0000
commHalo             0:    0.4112       0:    0.4112      0.4112      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.04 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.04 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             26.89 atoms/us
---------------------------------------------------

Tue Mar  4 14:38:50 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_9  #
####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:38:56 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (64 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:38:56

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303520, atom count : 4000000 

Tue Mar  4 14:38:56 2025: Initialization Finished

Tue Mar  4 14:38:56 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303520    -1.243619295120     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0305      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0341      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0350      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0351      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0351      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0352      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0352      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0352      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0351      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0350      4000000
Tue Mar  4 14:39:10 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303520
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      14.3155       14.3155      103.58
loop                       1      13.8207       13.8207      100.00
timestep                  10       1.3820       13.8196       99.99
  position               100       0.0011        0.1145        0.83
  velocity               200       0.0010        0.1948        1.41
  redistribute           101       0.0339        3.4255       24.79
    atomHalo             101       0.0167        1.6864       12.20
  force                  101       0.1009       10.1904       73.73
commHalo                 303       0.0013        0.3830        2.77
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   14.3155       0:   14.3155     14.3155      0.0000
loop                 0:   13.8207       0:   13.8207     13.8207      0.0000
timestep             0:   13.8196       0:   13.8196     13.8196      0.0000
  position           0:    0.1145       0:    0.1145      0.1145      0.0000
  velocity           0:    0.1948       0:    0.1948      0.1948      0.0000
  redistribute       0:    3.4255       0:    3.4255      3.4255      0.0000
    atomHalo         0:    1.6864       0:    1.6864      1.6864      0.0000
  force              0:   10.1904       0:   10.1904     10.1904      0.0000
commHalo             0:    0.3830       0:    0.3830      0.3830      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.03 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.03 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             28.94 atoms/us
---------------------------------------------------

Tue Mar  4 14:39:10 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10

To display your profiling results:
#####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                               #
#####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_10  #
#####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:39:16 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (72 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:39:16

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303512, atom count : 4000000 

Tue Mar  4 14:39:16 2025: Initialization Finished

Tue Mar  4 14:39:16 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303512    -1.243619295112     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0289      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0320      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0323      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0326      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0327      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0327      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0327      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0326      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0326      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0325      4000000
Tue Mar  4 14:39:29 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303512
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      13.3546       13.3546      103.79
loop                       1      12.8665       12.8665      100.00
timestep                  10       1.2865       12.8654       99.99
  position               100       0.0011        0.1071        0.83
  velocity               200       0.0010        0.1941        1.51
  redistribute           101       0.0341        3.4404       26.74
    atomHalo             101       0.0171        1.7222       13.38
  force                  101       0.0913        9.2217       71.67
commHalo                 303       0.0013        0.3919        3.05
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   13.3546       0:   13.3546     13.3546      0.0000
loop                 0:   12.8665       0:   12.8665     12.8665      0.0000
timestep             0:   12.8654       0:   12.8654     12.8654      0.0000
  position           0:    0.1071       0:    0.1071      0.1071      0.0000
  velocity           0:    0.1941       0:    0.1941      0.1941      0.0000
  redistribute       0:    3.4404       0:    3.4404      3.4404      0.0000
    atomHalo         0:    1.7222       0:    1.7222      1.7222      0.0000
  force              0:    9.2217       0:    9.2217      9.2217      0.0000
commHalo             0:    0.3919       0:    0.3919      0.3919      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.03 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.03 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             31.09 atoms/us
---------------------------------------------------

Tue Mar  4 14:39:29 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11

To display your profiling results:
#####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                               #
#####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_11  #
#####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:39:35 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (80 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:39:35

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303501, atom count : 4000000 

Tue Mar  4 14:39:35 2025: Initialization Finished

Tue Mar  4 14:39:35 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303501    -1.243619295101     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0267      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0296      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0302      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0305      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0305      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0306      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0305      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0305      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0304      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0304      4000000
Tue Mar  4 14:39:47 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303501
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      12.4718       12.4718      104.01
loop                       1      11.9914       11.9914      100.00
timestep                  10       1.1990       11.9903       99.99
  position               100       0.0011        0.1063        0.89
  velocity               200       0.0009        0.1825        1.52
  redistribute           101       0.0348        3.5180       29.34
    atomHalo             101       0.0167        1.6823       14.03
  force                  101       0.0819        8.2750       69.01
commHalo                 303       0.0013        0.3799        3.17
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   12.4718       0:   12.4718     12.4718      0.0000
loop                 0:   11.9914       0:   11.9914     11.9914      0.0000
timestep             0:   11.9903       0:   11.9903     11.9903      0.0000
  position           0:    0.1063       0:    0.1063      0.1063      0.0000
  velocity           0:    0.1825       0:    0.1825      0.1825      0.0000
  redistribute       0:    3.5180       0:    3.5180      3.5180      0.0000
    atomHalo         0:    1.6823       0:    1.6823      1.6823      0.0000
  force              0:    8.2750       0:    8.2750      8.2750      0.0000
commHalo             0:    0.3799       0:    0.3799      0.3799      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.03 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.03 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             33.36 atoms/us
---------------------------------------------------

Tue Mar  4 14:39:47 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12

To display your profiling results:
#####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                               #
#####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_12  #
#####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:39:53 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (88 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:39:53

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303491, atom count : 4000000 

Tue Mar  4 14:39:53 2025: Initialization Finished

Tue Mar  4 14:39:53 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303491    -1.243619295091     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0256      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0283      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0284      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0285      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0285      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0286      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0286      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0286      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0284      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0284      4000000
Tue Mar  4 14:40:05 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303491
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      11.7470       11.7470      104.20
loop                       1      11.2737       11.2737      100.00
timestep                  10       1.1273       11.2726       99.99
  position               100       0.0010        0.1023        0.91
  velocity               200       0.0009        0.1809        1.60
  redistribute           101       0.0350        3.5307       31.32
    atomHalo             101       0.0170        1.7203       15.26
  force                  101       0.0747        7.5445       66.92
commHalo                 303       0.0013        0.3916        3.47
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   11.7470       0:   11.7470     11.7470      0.0000
loop                 0:   11.2737       0:   11.2737     11.2737      0.0000
timestep             0:   11.2726       0:   11.2726     11.2726      0.0000
  position           0:    0.1023       0:    0.1023      0.1023      0.0000
  velocity           0:    0.1809       0:    0.1809      0.1809      0.0000
  redistribute       0:    3.5307       0:    3.5307      3.5307      0.0000
    atomHalo         0:    1.7203       0:    1.7203      1.7203      0.0000
  force              0:    7.5445       0:    7.5445      7.5445      0.0000
commHalo             0:    0.3916       0:    0.3916      0.3916      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.03 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.03 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             35.48 atoms/us
---------------------------------------------------

Tue Mar  4 14:40:05 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13

To display your profiling results:
#####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                               #
#####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_13  #
#####################################################################################################################################################################################################


* Info: Detected 1 Lprof instances in ip-172-31-47-249.ec2.internal. 
If this is incorrect, rerun with number-processes-per-node=X
Tue Mar  4 14:40:11 2025: Starting Initialization


Mini-Application Name    : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
  hostname: ip-172-31-47-249.ec2.internal
  kernel name: 'Linux'
  kernel release: '6.1.109-118.189.amzn2023.aarch64'
  processor: 'aarch64'
Build:
  CC: '/home/hbollore/soft/openmpi-5.0.6-gnu-14.2.0/bin/mpicc'
  compiler version: 'gcc (GCC) 14.2.0'
  CFLAGS: '-O3 -mcpu=native -DDO_MPI -O3 -mcpu=neoverse-v2 -larmpl -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -fopenmp -funroll-loops'
  LDFLAGS: ' '
  using MPI: true
  Threading: OpenMP (96 threads) 
  Double Precision: true
Run Date/Time: 2025-03-04, 14:40:11

Command Line Parameters:
  doeam: 0
  potDir: pots
  potName: Cu_u6.eam
  potType: funcfl
  nx: 100
  ny: 100
  nz: 100
  xproc: 1
  yproc: 1
  zproc: 1
  Lattice constant: -1 Angstroms
  nSteps: 100
  printRate: 10
  Time step: 1 fs
  Initial Temperature: 600 K
  Initial Delta: 0 Angstroms

Simulation data: 
  Total atoms        : 4000000
  Min global bounds  : [   0.0000000000,   0.0000000000,   0.0000000000 ]
  Max global bounds  : [ 361.5000000000, 361.5000000000, 361.5000000000 ]

Decomposition data: 
  Processors         :      1,     1,     1
  Local boxes        :     62,    62,    62 =   238328
  Box size           : [   5.8306451613,   5.8306451613,   5.8306451613 ]
  Box factor         : [   1.0074548875,   1.0074548875,   1.0074548875 ] 
  Max Link Cell Occupancy: 32 of 64

Potential data: 
  Potential type   : Lennard-Jones
  Species name     : Cu
  Atomic number    : 29
  Mass             : 63.55 amu
  Lattice Type     : FCC
  Lattice spacing  : 3.615 Angstroms
  Cutoff           : 5.7875 Angstroms
  Epsilon          : 0.167 eV
  Sigma            : 2.315 Angstroms


Initial energy : -1.166063303481, atom count : 4000000 

Tue Mar  4 14:40:11 2025: Initialization Finished

Tue Mar  4 14:40:11 2025: Starting simulation

#                                                                                         Performance
#  Loop   Time(fs)       Total Energy   Potential Energy     Kinetic Energy  Temperature   (us/atom)     # Atoms
      0       0.00    -1.166063303481    -1.243619295081     0.077555991600     600.0000     0.0000      4000000
     10      10.00    -1.166059650500    -1.233157709949     0.067098059449     519.0938     0.0240      4000000
     20      20.00    -1.166048438416    -1.208183014318     0.042134575902     325.9677     0.0265      4000000
     30      30.00    -1.166037590737    -1.186586197151     0.020548606414     158.9711     0.0261      4000000
     40      40.00    -1.166042093134    -1.183625399859     0.017583306724     136.0305     0.0261      4000000
     50      50.00    -1.166051684893    -1.193713710258     0.027662025365     214.0030     0.0262      4000000
     60      60.00    -1.166054646931    -1.202662201513     0.036607554582     283.2087     0.0262      4000000
     70      70.00    -1.166052143011    -1.204911990844     0.038859847833     300.6332     0.0262      4000000
     80      80.00    -1.166048803912    -1.203635015020     0.037586211108     290.7799     0.0262      4000000
     90      90.00    -1.166048006780    -1.203820491599     0.037772484818     292.2210     0.0261      4000000
    100     100.00    -1.166049793504    -1.206862845060     0.040813051556     315.7439     0.0261      4000000
Tue Mar  4 14:40:21 2025: Ending simulation



Simulation Validation:
  Initial energy  : -1.166063303481
  Final energy    : -1.166049793504
  eFinal/eInitial : 0.999988
  Final atom count : 4000000, no atoms lost


Timings for Rank 0
        Timer        # Calls    Avg/Call (s)   Total (s)    % Loop
___________________________________________________________________
total                      1      10.8593       10.8593      104.49
loop                       1      10.3927       10.3927      100.00
timestep                  10       1.0391       10.3915       99.99
  position               100       0.0010        0.1031        0.99
  velocity               200       0.0008        0.1629        1.57
  redistribute           101       0.0333        3.3607       32.34
    atomHalo             101       0.0166        1.6755       16.12
  force                  101       0.0678        6.8458       65.87
commHalo                 303       0.0012        0.3784        3.64
commReduce                39       0.0000        0.0001        0.00

Timing Statistics Across 1 Ranks:
        Timer        Rank: Min(s)       Rank: Max(s)      Avg(s)    Stdev(s)
_____________________________________________________________________________
total                0:   10.8593       0:   10.8593     10.8593      0.0000
loop                 0:   10.3927       0:   10.3927     10.3927      0.0000
timestep             0:   10.3915       0:   10.3915     10.3915      0.0000
  position           0:    0.1031       0:    0.1031      0.1031      0.0000
  velocity           0:    0.1629       0:    0.1629      0.1629      0.0000
  redistribute       0:    3.3607       0:    3.3607      3.3607      0.0000
    atomHalo         0:    1.6755       0:    1.6755      1.6755      0.0000
  force              0:    6.8458       0:    6.8458      6.8458      0.0000
commHalo             0:    0.3784       0:    0.3784      0.3784      0.0000
commReduce           0:    0.0001       0:    0.0001      0.0001      0.0000

---------------------------------------------------
 Average atom update rate:       0.03 us/atom/task
---------------------------------------------------


---------------------------------------------------
 Average all atom update rate:   0.03 us/atom
---------------------------------------------------


---------------------------------------------------
 Average atom rate:             38.49 atoms/us
---------------------------------------------------

Tue Mar  4 14:40:22 2025: CoMD Ending



Your experiment path is /home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14

To display your profiling results:
#####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                               #
#####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/qaas/qaas-runs/174-108-7908/intel/CoMD/run/oneview_runs/multicore/gcc_1/oneview_results_1741097920/tools/lprof_npsu_run_14  #
#####################################################################################################################################################################################################

×