options

Executable Output


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 570752)
* Info: Process launched (host o401, process 570754)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 1 threads on rank 0
    0->  0

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09923
  LPlusTimes                  10      54.17725
  LTimes                      10      55.50005
  Population                  10       4.30307
  Scattering                  10    1633.91200
  Solve                        1    1799.61574
  Source                      10       0.01678
  SweepSolver                 10      46.31607
  SweepSubdomain             160      34.49251

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.099231,54.177254,55.500052,4.303067,1633.912002,1799.615744,0.016775,46.316066,34.492513

Figures of Merit
================

  Throughput:         2.237440e+06 [unknowns/(second/iteration)]
  Grind time :        4.469394e-07 [(seconds/iteration)/unknowns]
  Sweep efficiency :  74.47203 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 570752)
* Info: Process finished (host o401, process 570754)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 570963)
* Info: Process launched (host o401, process 570965)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 2 threads on rank 0
    0->  0    1-> 28

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09432
  LPlusTimes                  10      28.11304
  LTimes                      10      30.68680
  Population                  10       2.32574
  Scattering                  10     819.95254
  Solve                        1     916.32075
  Source                      10       0.00921
  SweepSolver                 10      29.87011
  SweepSubdomain             160      21.30041

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.094320,28.113039,30.686803,2.325737,819.952537,916.320746,0.009213,29.870110,21.300414

Figures of Merit
================

  Throughput:         4.394238e+06 [unknowns/(second/iteration)]
  Grind time :        2.275707e-07 [(seconds/iteration)/unknowns]
  Sweep efficiency :  71.31013 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 570963)
* Info: Process finished (host o401, process 570965)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 571123)
* Info: Process launched (host o401, process 571125)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 4 threads on rank 0
    0->  0    1-> 14    2-> 28    3-> 42

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.10382
  LPlusTimes                  10      13.96254
  LTimes                      10      16.94341
  Population                  10       1.11011
  Scattering                  10     415.74811
  Solve                        1     471.36520
  Source                      10       0.00542
  SweepSolver                 10      18.23974
  SweepSubdomain             160      11.02036

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.103824,13.962536,16.943412,1.110111,415.748112,471.365196,0.005420,18.239741,11.020357

Figures of Merit
================

  Throughput:         8.542277e+06 [unknowns/(second/iteration)]
  Grind time :        1.170648e-07 [(seconds/iteration)/unknowns]
  Sweep efficiency :  60.41948 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 571125)
* Info: Process finished (host o401, process 571123)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 571266)
* Info: Process launched (host o401, process 571268)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 8 threads on rank 0
    0->  0    1->  7    2-> 14    3-> 21    4-> 28    5-> 35    6-> 42    7-> 49

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09190
  LPlusTimes                  10       8.19369
  LTimes                      10       9.75829
  Population                  10       1.13048
  Scattering                  10     218.09247
  Solve                        1     250.24649
  Source                      10       0.00354
  SweepSolver                 10       7.73954
  SweepSubdomain             160       5.53984

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091896,8.193685,9.758293,1.130478,218.092475,250.246486,0.003537,7.739544,5.539837

Figures of Merit
================

  Throughput:         1.609026e+07 [unknowns/(second/iteration)]
  Grind time :        6.214939e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  71.57834 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 571266)
* Info: Process finished (host o401, process 571268)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 571426)
* Info: Process launched (host o401, process 571428)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 16 threads on rank 0
    0->  0    1->115    2->  7    3->122    4-> 14    5->129    6-> 21    7->136
    8-> 28    9->143   10-> 35   11->150   12-> 42   13->157   14-> 49   15->164

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09165
  LPlusTimes                  10       4.72703
  LTimes                      10       6.18573
  Population                  10       0.40650
  Scattering                  10     115.15252
  Solve                        1     136.49909
  Source                      10       0.00280
  SweepSolver                 10       4.68193
  SweepSubdomain             160       2.85956

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091647,4.727033,6.185730,0.406503,115.152518,136.499087,0.002797,4.681925,2.859563

Figures of Merit
================

  Throughput:         2.949860e+07 [unknowns/(second/iteration)]
  Grind time :        3.389991e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  61.07665 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 571428)
* Info: Process finished (host o401, process 571426)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 571647)
* Info: Process launched (host o401, process 571649)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 32 threads on rank 0
    0->  0    1->  2    2->  4    3->  6    4->  8    5-> 10    6-> 12    7-> 14
    8-> 16    9-> 18   10-> 20   11-> 22   12-> 24   13-> 26   14-> 28   15-> 30
   16-> 32   17->145   18-> 35   19->148   20-> 38   21->151   22-> 41   23->154
   24-> 44   25->157   26-> 47   27->160   28-> 50   29->163   30-> 53   31->166

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09175
  LPlusTimes                  10       3.91778
  LTimes                      10       4.82476
  Population                  10       0.38046
  Scattering                  10      61.00209
  Solve                        1      78.60612
  Source                      10       0.00243
  SweepSolver                 10       3.13457
  SweepSubdomain             160       1.61753

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091754,3.917781,4.824755,0.380461,61.002094,78.606120,0.002432,3.134565,1.617531

Figures of Merit
================

  Throughput:         5.122415e+07 [unknowns/(second/iteration)]
  Grind time :        1.952204e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  51.60304 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 571649)
* Info: Process finished (host o401, process 571647)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5  #
########################################################################################################################################################################################################


* Info: Selecting the 'perf-high-ppn' engine for node o401

* Info: Process launched (host o401, process 571998)
* Info: Process launched (host o401, process 572000)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 56 threads on rank 0
    0->  0    1->  1    2->  2    3->  3    4->  4    5->  5    6->  6    7->  7
    8->  8    9->  9   10-> 10   11-> 11   12-> 12   13-> 13   14-> 14   15-> 15
   16-> 16   17-> 17   18-> 18   19-> 19   20-> 20   21-> 21   22-> 22   23-> 23
   24-> 24   25-> 25   26-> 26   27-> 27   28-> 28   29-> 29   30-> 30   31-> 31
   32-> 32   33-> 33   34-> 34   35-> 35   36-> 36   37-> 37   38-> 38   39-> 39
   40-> 40   41-> 41   42-> 42   43-> 43   44-> 44   45-> 45   46-> 46   47-> 47
   48-> 48   49-> 49   50-> 50   51-> 51   52-> 52   53-> 53   54-> 54   55-> 55

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       2
    Spatial decomp:        2 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             2           1 / 2
  (Rx,Ry,Rz) R in XYZ:   2x1x1       1x1x1 / 2x1x1
  (PQR) TOTAL:           2           16 / 32

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       25165824      192.000
  k_plane                       25165824      192.000
  mixelem_to_fraction               4352        0.033
  phi                          104857600      800.000
  phi_out                      104857600      800.000
  psi                          402653184     3072.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          402653184     3072.000
  sigt_zonal                     4194304       32.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                       1110455664     8472.104

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.197998e+09, change=1.000000e+00
  iter 1: particle count=1.801368e+09, change=3.349511e-01
  iter 2: particle count=2.102278e+09, change=1.431351e-01
  iter 3: particle count=2.251810e+09, change=6.640521e-02
  iter 4: particle count=2.325888e+09, change=3.184924e-02
  iter 5: particle count=2.362467e+09, change=1.548355e-02
  iter 6: particle count=2.380471e+09, change=7.563193e-03
  iter 7: particle count=2.389305e+09, change=3.697158e-03
  iter 8: particle count=2.393627e+09, change=1.805479e-03
  iter 9: particle count=2.395735e+09, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.09159
  LPlusTimes                  10       5.14745
  LTimes                      10       4.77127
  Population                  10       0.78831
  Scattering                  10      49.91892
  Solve                        1      69.28979
  Source                      10       0.00218
  SweepSolver                 10       3.32337
  SweepSubdomain             160       1.69590

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091585,5.147448,4.771266,0.788306,49.918920,69.289795,0.002179,3.323375,1.695904

Figures of Merit
================

  Throughput:         5.811147e+07 [unknowns/(second/iteration)]
  Grind time :        1.720831e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  51.02958 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 402653184

END

* Info: Process finished (host o401, process 572000)
* Info: Process finished (host o401, process 571998)

Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6

To display your profiling results:
########################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                                COMMAND                                                                                #
########################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6  #
########################################################################################################################################################################################################

×