options

Executable Output


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115663)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115668)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 1 threads on rank 0
    0->  0

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04195
  LPlusTimes                  10      22.15931
  LTimes                      10      23.37315
  Population                  10       8.55787
  Scattering                  10    1160.24743
  Solve                        1    1237.90936
  Source                      10       0.11748
  SweepSolver                 10      19.66133
  SweepSubdomain             160      18.26552

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.041948,22.159314,23.373153,8.557865,1160.247426,1237.909362,0.117478,19.661333,18.265518

Figures of Merit
================

  Throughput:         3.252687e+06 [unknowns/(second/iteration)]
  Grind time :        3.074381e-07 [(seconds/iteration)/unknowns]
  Sweep efficiency :  92.90071 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115668)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115663)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115805)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115810)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 2 threads on rank 0
    0->  0    1-> 24

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04259
  LPlusTimes                  10      11.17036
  LTimes                      10      12.52751
  Population                  10       5.06895
  Scattering                  10     579.54534
  Solve                        1     625.56195
  Source                      10       0.06132
  SweepSolver                 10      13.16952
  SweepSubdomain             160      11.24114

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.042591,11.170359,12.527508,5.068955,579.545337,625.561952,0.061316,13.169523,11.241136

Figures of Merit
================

  Throughput:         6.436664e+06 [unknowns/(second/iteration)]
  Grind time :        1.553600e-07 [(seconds/iteration)/unknowns]
  Sweep efficiency :  85.35720 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115810)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115805)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115916)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115921)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 4 threads on rank 0
    0->  0    1-> 12    2-> 24    3-> 36

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04238
  LPlusTimes                  10       5.96350
  LTimes                      10       6.35309
  Population                  10       2.41175
  Scattering                  10     290.57646
  Solve                        1     317.08123
  Source                      10       0.03232
  SweepSolver                 10       7.54883
  SweepSubdomain             160       5.90563

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.042382,5.963496,6.353088,2.411746,290.576460,317.081233,0.032320,7.548835,5.905629

Figures of Merit
================

  Throughput:         1.269874e+07 [unknowns/(second/iteration)]
  Grind time :        7.874798e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  78.23233 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115921)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115916)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116018)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116024)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 8 threads on rank 0
    0->  0    1->  6    2-> 12    3-> 18    4-> 24    5-> 30    6-> 36    7-> 42

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04076
  LPlusTimes                  10       3.44268
  LTimes                      10       3.51172
  Population                  10       1.25713
  Scattering                  10     147.00756
  Solve                        1     164.11541
  Source                      10       0.01930
  SweepSolver                 10       4.79181
  SweepSubdomain             160       3.21864

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040759,3.442679,3.511724,1.257130,147.007564,164.115412,0.019302,4.791809,3.218641

Figures of Merit
================

  Throughput:         2.453476e+07 [unknowns/(second/iteration)]
  Grind time :        4.075850e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  67.16964 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116024)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116018)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116138)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116143)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 16 threads on rank 0
    0->  0    1->  3    2->  6    3->  9    4-> 12    5-> 15    6-> 18    7-> 21
    8-> 24    9-> 27   10-> 30   11-> 33   12-> 36   13-> 39   14-> 42   15-> 45

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.03792
  LPlusTimes                  10       1.94862
  LTimes                      10       2.19973
  Population                  10       0.25538
  Scattering                  10      77.30844
  Solve                        1      91.72583
  Source                      10       0.00859
  SweepSolver                 10       5.91798
  SweepSubdomain             160       1.66461

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.037920,1.948616,2.199732,0.255380,77.308443,91.725827,0.008591,5.917984,1.664609

Figures of Merit
================

  Throughput:         4.389747e+07 [unknowns/(second/iteration)]
  Grind time :        2.278036e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  28.12797 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116143)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116138)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116268)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116273)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 32 threads on rank 0
    0->  0    1-> 97    2->  3    3->100    4->  6    5->103    6->  9    7->106
    8-> 12    9->109   10-> 15   11->112   12-> 18   13->115   14-> 21   15->118
   16-> 24   17->121   18-> 27   19->124   20-> 30   21->127   22-> 33   23->130
   24-> 36   25->133   26-> 39   27->136   28-> 42   29->139   30-> 45   31->142

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04093
  LPlusTimes                  10       1.59042
  LTimes                      10       1.66630
  Population                  10       0.13222
  Scattering                  10      47.18606
  Solve                        1      60.46603
  Source                      10       0.00631
  SweepSolver                 10       5.62719
  SweepSubdomain             160       0.98740

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040928,1.590425,1.666296,0.132224,47.186056,60.466029,0.006309,5.627190,0.987396

Figures of Merit
================

  Throughput:         6.659164e+07 [unknowns/(second/iteration)]
  Grind time :        1.501690e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  17.54687 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116273)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116268)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5  #
#################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com

* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116464)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116469)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 48 threads on rank 0
    0->  0    1->  1    2->  2    3->  3    4->  4    5->  5    6->  6    7->  7
    8->  8    9->  9   10-> 10   11-> 11   12-> 12   13-> 13   14-> 14   15-> 15
   16-> 16   17-> 17   18-> 18   19-> 19   20-> 20   21-> 21   22-> 22   23-> 23
   24-> 24   25-> 25   26-> 26   27-> 27   28-> 28   29-> 29   30-> 30   31-> 31
   32-> 32   33-> 33   34-> 34   35-> 35   36-> 36   37-> 37   38-> 38   39-> 39
   40-> 40   41-> 41   42-> 42   43-> 43   44-> 44   45-> 45   46-> 46   47-> 47

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04013
  LPlusTimes                  10       2.89191
  LTimes                      10       1.69540
  Population                  10       0.12031
  Scattering                  10      38.12629
  Solve                        1      61.28889
  Source                      10       0.00472
  SweepSolver                 10      13.83383
  SweepSubdomain             160       0.79122

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040127,2.891908,1.695405,0.120311,38.126292,61.288892,0.004720,13.833830,0.791221

Figures of Merit
================

  Throughput:         6.569758e+07 [unknowns/(second/iteration)]
  Grind time :        1.522126e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  5.71946 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116469)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116464)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6

To display your profiling results:
#################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                            COMMAND                                                                             #
#################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6  #
#################################################################################################################################################################################################

×