* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 80019 tid 80019 thread 0 bound to OS proc set {0}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 1 threads on rank 0
0-> 0
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05141
LPlusTimes 10 92.81886
LTimes 10 94.34494
Population 10 5.09369
Scattering 10 2493.57055
Solve 1 2760.82239
Source 10 0.19390
SweepSolver 10 71.20137
SweepSubdomain 160 70.79065
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051413,92.818857,94.344937,5.093688,2493.570548,2760.822389,0.193898,71.201372,70.790646
Figures of Merit
================
Throughput: 2.187681e+06 [unknowns/(second/iteration)]
Grind time : 4.571051e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 99.42315 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 81341 tid 81341 thread 0 bound to OS proc set {0}
OMP: pid 81341 tid 81412 thread 1 bound to OS proc set {32}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 2 threads on rank 0
0-> 0 1-> 32
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05089
LPlusTimes 10 45.65687
LTimes 10 47.72784
Population 10 2.48047
Scattering 10 1281.77897
Solve 1 1417.18129
Source 10 0.10124
SweepSolver 10 35.83204
SweepSubdomain 160 35.42023
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.050891,45.656872,47.727837,2.480469,1281.778971,1417.181291,0.101239,35.832038,35.420227
Figures of Merit
================
Throughput: 4.261838e+06 [unknowns/(second/iteration)]
Grind time : 2.346405e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 98.85072 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 82138 tid 82138 thread 0 bound to OS proc set {0}
OMP: pid 82138 tid 82210 thread 2 bound to OS proc set {32}
OMP: pid 82138 tid 82209 thread 1 bound to OS proc set {16}
OMP: pid 82138 tid 82211 thread 3 bound to OS proc set {48}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 4 threads on rank 0
0-> 0 1-> 16 2-> 32 3-> 48
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05170
LPlusTimes 10 22.81982
LTimes 10 24.27408
Population 10 1.24758
Scattering 10 649.14041
Solve 1 719.25205
Source 10 0.05069
SweepSolver 10 18.11448
SweepSubdomain 160 17.70220
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051700,22.819824,24.274078,1.247582,649.140406,719.252048,0.050692,18.114484,17.702204
Figures of Merit
================
Throughput: 8.397331e+06 [unknowns/(second/iteration)]
Grind time : 1.190855e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 97.72403 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 82592 tid 82592 thread 0 bound to OS proc set {0}
OMP: pid 82592 tid 82667 thread 5 bound to OS proc set {40}
OMP: pid 82592 tid 82664 thread 2 bound to OS proc set {16}
OMP: pid 82592 tid 82663 thread 1 bound to OS proc set {8}
OMP: pid 82592 tid 82666 thread 4 bound to OS proc set {32}
OMP: pid 82592 tid 82665 thread 3 bound to OS proc set {24}
OMP: pid 82592 tid 82668 thread 6 bound to OS proc set {48}
OMP: pid 82592 tid 82669 thread 7 bound to OS proc set {56}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 8 threads on rank 0
0-> 0 1-> 8 2-> 16 3-> 24 4-> 32 5-> 40 6-> 48 7-> 56
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05161
LPlusTimes 10 11.53767
LTimes 10 12.66354
Population 10 0.62870
Scattering 10 322.49948
Solve 1 360.20517
Source 10 0.02540
SweepSolver 10 9.26777
SweepSubdomain 160 8.85780
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051608,11.537667,12.663535,0.628702,322.499480,360.205172,0.025399,9.267768,8.857795
Figures of Merit
================
Throughput: 1.676766e+07 [unknowns/(second/iteration)]
Grind time : 5.963861e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 95.57636 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 82843 tid 82843 thread 0 bound to OS proc set {0}
OMP: pid 82843 tid 82925 thread 12 bound to OS proc set {48}
OMP: pid 82843 tid 82916 thread 3 bound to OS proc set {12}
OMP: pid 82843 tid 82922 thread 9 bound to OS proc set {36}
OMP: pid 82843 tid 82924 thread 11 bound to OS proc set {44}
OMP: pid 82843 tid 82923 thread 10 bound to OS proc set {40}
OMP: pid 82843 tid 82915 thread 2 bound to OS proc set {8}
OMP: pid 82843 tid 82921 thread 8 bound to OS proc set {32}
OMP: pid 82843 tid 82920 thread 7 bound to OS proc set {28}
OMP: pid 82843 tid 82917 thread 4 bound to OS proc set {16}
OMP: pid 82843 tid 82914 thread 1 bound to OS proc set {4}
OMP: pid 82843 tid 82926 thread 13 bound to OS proc set {52}
OMP: pid 82843 tid 82927 thread 14 bound to OS proc set {56}
OMP: pid 82843 tid 82918 thread 5 bound to OS proc set {20}
OMP: pid 82843 tid 82919 thread 6 bound to OS proc set {24}
OMP: pid 82843 tid 82928 thread 15 bound to OS proc set {60}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 16 threads on rank 0
0-> 0 1-> 4 2-> 8 3-> 12 4-> 16 5-> 20 6-> 24 7-> 28
8-> 32 9-> 36 10-> 40 11-> 44 12-> 48 13-> 52 14-> 56 15-> 60
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05122
LPlusTimes 10 5.91044
LTimes 10 7.13135
Population 10 0.32650
Scattering 10 162.55639
Solve 1 184.36280
Source 10 0.01279
SweepSolver 10 4.84578
SweepSubdomain 160 4.43555
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051216,5.910438,7.131350,0.326502,162.556395,184.362800,0.012788,4.845781,4.435554
Figures of Merit
================
Throughput: 3.276039e+07 [unknowns/(second/iteration)]
Grind time : 3.052466e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 91.53434 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83049 tid 83049 thread 0 bound to OS proc set {0}
OMP: pid 83049 tid 83135 thread 16 bound to OS proc set {43}
OMP: pid 83049 tid 83134 thread 15 bound to OS proc set {40}
OMP: pid 83049 tid 83133 thread 14 bound to OS proc set {37}
OMP: pid 83049 tid 83122 thread 3 bound to OS proc set {8}
OMP: pid 83049 tid 83130 thread 11 bound to OS proc set {29}
OMP: pid 83049 tid 83127 thread 8 bound to OS proc set {21}
OMP: pid 83049 tid 83136 thread 17 bound to OS proc set {46}
OMP: pid 83049 tid 83121 thread 2 bound to OS proc set {5}
OMP: pid 83049 tid 83120 thread 1 bound to OS proc set {2}
OMP: pid 83049 tid 83129 thread 10 bound to OS proc set {27}
OMP: pid 83049 tid 83123 thread 4 bound to OS proc set {10}
OMP: pid 83049 tid 83137 thread 18 bound to OS proc set {48}
OMP: pid 83049 tid 83131 thread 12 bound to OS proc set {32}
OMP: pid 83049 tid 83132 thread 13 bound to OS proc set {35}
OMP: pid 83049 tid 83139 thread 20 bound to OS proc set {54}
OMP: pid 83049 tid 83138 thread 19 bound to OS proc set {51}
OMP: pid 83049 tid 83126 thread 7 bound to OS proc set {18}
OMP: pid 83049 tid 83128 thread 9 bound to OS proc set {24}
OMP: pid 83049 tid 83140 thread 21 bound to OS proc set {56}
OMP: pid 83049 tid 83141 thread 22 bound to OS proc set {59}
OMP: pid 83049 tid 83124 thread 5 bound to OS proc set {13}
OMP: pid 83049 tid 83125 thread 6 bound to OS proc set {16}
OMP: pid 83049 tid 83142 thread 23 bound to OS proc set {62}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 24 threads on rank 0
0-> 0 1-> 2 2-> 5 3-> 8 4-> 10 5-> 13 6-> 16 7-> 18
8-> 21 9-> 24 10-> 27 11-> 29 12-> 32 13-> 35 14-> 37 15-> 40
16-> 43 17-> 46 18-> 48 19-> 51 20-> 54 21-> 56 22-> 59 23-> 62
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05081
LPlusTimes 10 3.89081
LTimes 10 6.35187
Population 10 0.22861
Scattering 10 109.86788
Solve 1 127.31083
Source 10 0.00859
SweepSolver 10 3.37083
SweepSubdomain 160 2.96010
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.050808,3.890808,6.351866,0.228606,109.867875,127.310834,0.008592,3.370826,2.960099
Figures of Merit
================
Throughput: 4.744135e+07 [unknowns/(second/iteration)]
Grind time : 2.107866e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 87.81524 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83214 tid 83214 thread 0 bound to OS proc set {0}
OMP: pid 83214 tid 83312 thread 28 bound to OS proc set {56}
OMP: pid 83214 tid 83300 thread 16 bound to OS proc set {32}
OMP: pid 83214 tid 83299 thread 15 bound to OS proc set {30}
OMP: pid 83214 tid 83302 thread 18 bound to OS proc set {36}
OMP: pid 83214 tid 83292 thread 8 bound to OS proc set {16}
OMP: pid 83214 tid 83303 thread 19 bound to OS proc set {38}
OMP: pid 83214 tid 83287 thread 3 bound to OS proc set {6}
OMP: pid 83214 tid 83288 thread 4 bound to OS proc set {8}
OMP: pid 83214 tid 83286 thread 2 bound to OS proc set {4}
OMP: pid 83214 tid 83285 thread 1 bound to OS proc set {2}
OMP: pid 83214 tid 83298 thread 14 bound to OS proc set {28}
OMP: pid 83214 tid 83295 thread 11 bound to OS proc set {22}
OMP: pid 83214 tid 83304 thread 20 bound to OS proc set {40}
OMP: pid 83214 tid 83301 thread 17 bound to OS proc set {34}
OMP: pid 83214 tid 83309 thread 25 bound to OS proc set {50}
OMP: pid 83214 tid 83291 thread 7 bound to OS proc set {14}
OMP: pid 83214 tid 83311 thread 27 bound to OS proc set {54}
OMP: pid 83214 tid 83294 thread 10 bound to OS proc set {20}
OMP: pid 83214 tid 83310 thread 26 bound to OS proc set {52}
OMP: pid 83214 tid 83308 thread 24 bound to OS proc set {48}
OMP: pid 83214 tid 83306 thread 22 bound to OS proc set {44}
OMP: pid 83214 tid 83297 thread 13 bound to OS proc set {26}
OMP: pid 83214 tid 83290 thread 6 bound to OS proc set {12}
OMP: pid 83214 tid 83296 thread 12 bound to OS proc set {24}
OMP: pid 83214 tid 83293 thread 9 bound to OS proc set {18}
OMP: pid 83214 tid 83307 thread 23 bound to OS proc set {46}
OMP: pid 83214 tid 83314 thread 30 bound to OS proc set {60}
OMP: pid 83214 tid 83289 thread 5 bound to OS proc set {10}
OMP: pid 83214 tid 83313 thread 29 bound to OS proc set {58}
OMP: pid 83214 tid 83305 thread 21 bound to OS proc set {42}
OMP: pid 83214 tid 83315 thread 31 bound to OS proc set {62}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 32 threads on rank 0
0-> 0 1-> 2 2-> 4 3-> 6 4-> 8 5-> 10 6-> 12 7-> 14
8-> 16 9-> 18 10-> 20 11-> 22 12-> 24 13-> 26 14-> 28 15-> 30
16-> 32 17-> 34 18-> 36 19-> 38 20-> 40 21-> 42 22-> 44 23-> 46
24-> 48 25-> 50 26-> 52 27-> 54 28-> 56 29-> 58 30-> 60 31-> 62
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05100
LPlusTimes 10 3.13494
LTimes 10 5.95706
Population 10 0.21136
Scattering 10 83.01427
Solve 1 98.53927
Source 10 0.00650
SweepSolver 10 2.63537
SweepSubdomain 160 2.22467
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.050996,3.134935,5.957062,0.211363,83.014270,98.539267,0.006499,2.635369,2.224673
Figures of Merit
================
Throughput: 6.129331e+07 [unknowns/(second/iteration)]
Grind time : 1.631499e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 84.41602 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83385 tid 83385 thread 0 bound to OS proc set {0}
OMP: pid 83385 tid 83487 thread 32 bound to OS proc set {52}
OMP: pid 83385 tid 83491 thread 36 bound to OS proc set {58}
OMP: pid 83385 tid 83470 thread 15 bound to OS proc set {24}
OMP: pid 83385 tid 83468 thread 13 bound to OS proc set {21}
OMP: pid 83385 tid 83458 thread 3 bound to OS proc set {4}
OMP: pid 83385 tid 83474 thread 19 bound to OS proc set {30}
OMP: pid 83385 tid 83479 thread 24 bound to OS proc set {39}
OMP: pid 83385 tid 83469 thread 14 bound to OS proc set {22}
OMP: pid 83385 tid 83483 thread 28 bound to OS proc set {45}
OMP: pid 83385 tid 83463 thread 8 bound to OS proc set {13}
OMP: pid 83385 tid 83467 thread 12 bound to OS proc set {19}
OMP: pid 83385 tid 83473 thread 18 bound to OS proc set {29}
OMP: pid 83385 tid 83461 thread 6 bound to OS proc set {9}
OMP: pid 83385 tid 83475 thread 20 bound to OS proc set {32}
OMP: pid 83385 tid 83471 thread 16 bound to OS proc set {26}
OMP: pid 83385 tid 83457 thread 2 bound to OS proc set {3}
OMP: pid 83385 tid 83466 thread 11 bound to OS proc set {17}
OMP: pid 83385 tid 83459 thread 4 bound to OS proc set {6}
OMP: pid 83385 tid 83465 thread 10 bound to OS proc set {16}
OMP: pid 83385 tid 83485 thread 30 bound to OS proc set {48}
OMP: pid 83385 tid 83484 thread 29 bound to OS proc set {47}
OMP: pid 83385 tid 83460 thread 5 bound to OS proc set {8}
OMP: pid 83385 tid 83464 thread 9 bound to OS proc set {14}
OMP: pid 83385 tid 83478 thread 23 bound to OS proc set {37}
OMP: pid 83385 tid 83490 thread 35 bound to OS proc set {56}
OMP: pid 83385 tid 83482 thread 27 bound to OS proc set {43}
OMP: pid 83385 tid 83472 thread 17 bound to OS proc set {27}
OMP: pid 83385 tid 83489 thread 34 bound to OS proc set {55}
OMP: pid 83385 tid 83488 thread 33 bound to OS proc set {53}
OMP: pid 83385 tid 83486 thread 31 bound to OS proc set {50}
OMP: pid 83385 tid 83481 thread 26 bound to OS proc set {42}
OMP: pid 83385 tid 83456 thread 1 bound to OS proc set {1}
OMP: pid 83385 tid 83477 thread 22 bound to OS proc set {35}
OMP: pid 83385 tid 83476 thread 21 bound to OS proc set {34}
OMP: pid 83385 tid 83462 thread 7 bound to OS proc set {11}
OMP: pid 83385 tid 83493 thread 38 bound to OS proc set {61}
OMP: pid 83385 tid 83492 thread 37 bound to OS proc set {60}
OMP: pid 83385 tid 83480 thread 25 bound to OS proc set {40}
OMP: pid 83385 tid 83494 thread 39 bound to OS proc set {63}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 40 threads on rank 0
0-> 0 1-> 1 2-> 3 3-> 4 4-> 6 5-> 8 6-> 9 7-> 11
8-> 13 9-> 14 10-> 16 11-> 17 12-> 19 13-> 21 14-> 22 15-> 24
16-> 26 17-> 27 18-> 29 19-> 30 20-> 32 21-> 34 22-> 35 23-> 37
24-> 39 25-> 40 26-> 42 27-> 43 28-> 45 29-> 47 30-> 48 31-> 50
32-> 52 33-> 53 34-> 55 35-> 56 36-> 58 37-> 60 38-> 61 39-> 63
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05177
LPlusTimes 10 3.71823
LTimes 10 3.05409
Population 10 0.18996
Scattering 10 66.37650
Solve 1 79.12460
Source 10 0.00524
SweepSolver 10 2.19773
SweepSubdomain 160 1.78747
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051773,3.718226,3.054091,0.189965,66.376499,79.124602,0.005241,2.197730,1.787470
Figures of Merit
================
Throughput: 7.633274e+07 [unknowns/(second/iteration)]
Grind time : 1.310054e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 81.33256 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83563 tid 83563 thread 0 bound to OS proc set {0}
OMP: pid 83563 tid 83645 thread 12 bound to OS proc set {16}
OMP: pid 83563 tid 83665 thread 32 bound to OS proc set {43}
OMP: pid 83563 tid 83634 thread 1 bound to OS proc set {1}
OMP: pid 83563 tid 83657 thread 24 bound to OS proc set {32}
OMP: pid 83563 tid 83646 thread 13 bound to OS proc set {17}
OMP: pid 83563 tid 83640 thread 7 bound to OS proc set {9}
OMP: pid 83563 tid 83636 thread 3 bound to OS proc set {4}
OMP: pid 83563 tid 83635 thread 2 bound to OS proc set {2}
OMP: pid 83563 tid 83666 thread 33 bound to OS proc set {44}
OMP: pid 83563 tid 83641 thread 8 bound to OS proc set {10}
OMP: pid 83563 tid 83637 thread 4 bound to OS proc set {5}
OMP: pid 83563 tid 83649 thread 16 bound to OS proc set {21}
OMP: pid 83563 tid 83651 thread 18 bound to OS proc set {24}
OMP: pid 83563 tid 83643 thread 10 bound to OS proc set {13}
OMP: pid 83563 tid 83644 thread 11 bound to OS proc set {14}
OMP: pid 83563 tid 83668 thread 35 bound to OS proc set {47}
OMP: pid 83563 tid 83661 thread 28 bound to OS proc set {37}
OMP: pid 83563 tid 83642 thread 9 bound to OS proc set {12}
OMP: pid 83563 tid 83677 thread 44 bound to OS proc set {59}
OMP: pid 83563 tid 83659 thread 26 bound to OS proc set {35}
OMP: pid 83563 tid 83653 thread 20 bound to OS proc set {27}
OMP: pid 83563 tid 83638 thread 5 bound to OS proc set {6}
OMP: pid 83563 tid 83660 thread 27 bound to OS proc set {36}
OMP: pid 83563 tid 83648 thread 15 bound to OS proc set {20}
OMP: pid 83563 tid 83652 thread 19 bound to OS proc set {25}
OMP: pid 83563 tid 83664 thread 31 bound to OS proc set {41}
OMP: pid 83563 tid 83678 thread 45 bound to OS proc set {60}
OMP: pid 83563 tid 83670 thread 37 bound to OS proc set {50}
OMP: pid 83563 tid 83647 thread 14 bound to OS proc set {18}
OMP: pid 83563 tid 83667 thread 34 bound to OS proc set {46}
OMP: pid 83563 tid 83663 thread 30 bound to OS proc set {40}
OMP: pid 83563 tid 83669 thread 36 bound to OS proc set {48}
OMP: pid 83563 tid 83639 thread 6 bound to OS proc set {8}
OMP: pid 83563 tid 83674 thread 41 bound to OS proc set {55}
OMP: pid 83563 tid 83673 thread 40 bound to OS proc set {54}
OMP: pid 83563 tid 83680 thread 47 bound to OS proc set {63}
OMP: pid 83563 tid 83676 thread 43 bound to OS proc set {58}
OMP: pid 83563 tid 83671 thread 38 bound to OS proc set {51}
OMP: pid 83563 tid 83662 thread 29 bound to OS proc set {39}
OMP: pid 83563 tid 83672 thread 39 bound to OS proc set {52}
OMP: pid 83563 tid 83654 thread 21 bound to OS proc set {28}
OMP: pid 83563 tid 83655 thread 22 bound to OS proc set {29}
OMP: pid 83563 tid 83675 thread 42 bound to OS proc set {56}
OMP: pid 83563 tid 83656 thread 23 bound to OS proc set {31}
OMP: pid 83563 tid 83679 thread 46 bound to OS proc set {62}
OMP: pid 83563 tid 83650 thread 17 bound to OS proc set {23}
OMP: pid 83563 tid 83658 thread 25 bound to OS proc set {33}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 48 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 4 4-> 5 5-> 6 6-> 8 7-> 9
8-> 10 9-> 12 10-> 13 11-> 14 12-> 16 13-> 17 14-> 18 15-> 20
16-> 21 17-> 23 18-> 24 19-> 25 20-> 27 21-> 28 22-> 29 23-> 31
24-> 32 25-> 33 26-> 35 27-> 36 28-> 37 29-> 39 30-> 40 31-> 41
32-> 43 33-> 44 34-> 46 35-> 47 36-> 48 37-> 50 38-> 51 39-> 52
40-> 54 41-> 55 42-> 56 43-> 58 44-> 59 45-> 60 46-> 62 47-> 63
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05139
LPlusTimes 10 2.22501
LTimes 10 4.53467
Population 10 0.17966
Scattering 10 55.47249
Solve 1 67.90266
Source 10 0.00442
SweepSolver 10 1.89900
SweepSubdomain 160 1.48716
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051389,2.225009,4.534670,0.179656,55.472495,67.902660,0.004419,1.898998,1.487159
Figures of Merit
================
Throughput: 8.894788e+07 [unknowns/(second/iteration)]
Grind time : 1.124254e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 78.31281 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83753 tid 83753 thread 0 bound to OS proc set {0}
OMP: pid 83753 tid 83871 thread 48 bound to OS proc set {55}
OMP: pid 83753 tid 83839 thread 16 bound to OS proc set {18}
OMP: pid 83753 tid 83837 thread 14 bound to OS proc set {16}
OMP: pid 83753 tid 83838 thread 15 bound to OS proc set {17}
OMP: pid 83753 tid 83841 thread 18 bound to OS proc set {20}
OMP: pid 83753 tid 83835 thread 12 bound to OS proc set {13}
OMP: pid 83753 tid 83826 thread 3 bound to OS proc set {3}
OMP: pid 83753 tid 83831 thread 8 bound to OS proc set {9}
OMP: pid 83753 tid 83876 thread 53 bound to OS proc set {61}
OMP: pid 83753 tid 83825 thread 2 bound to OS proc set {2}
OMP: pid 83753 tid 83830 thread 7 bound to OS proc set {8}
OMP: pid 83753 tid 83842 thread 19 bound to OS proc set {22}
OMP: pid 83753 tid 83863 thread 40 bound to OS proc set {46}
OMP: pid 83753 tid 83874 thread 51 bound to OS proc set {59}
OMP: pid 83753 tid 83870 thread 47 bound to OS proc set {54}
OMP: pid 83753 tid 83836 thread 13 bound to OS proc set {15}
OMP: pid 83753 tid 83840 thread 17 bound to OS proc set {19}
OMP: pid 83753 tid 83856 thread 33 bound to OS proc set {38}
OMP: pid 83753 tid 83827 thread 4 bound to OS proc set {4}
OMP: pid 83753 tid 83855 thread 32 bound to OS proc set {37}
OMP: pid 83753 tid 83832 thread 9 bound to OS proc set {10}
OMP: pid 83753 tid 83858 thread 35 bound to OS proc set {40}
OMP: pid 83753 tid 83834 thread 11 bound to OS proc set {12}
OMP: pid 83753 tid 83850 thread 27 bound to OS proc set {31}
OMP: pid 83753 tid 83847 thread 24 bound to OS proc set {27}
OMP: pid 83753 tid 83857 thread 34 bound to OS proc set {39}
OMP: pid 83753 tid 83872 thread 49 bound to OS proc set {56}
OMP: pid 83753 tid 83873 thread 50 bound to OS proc set {58}
OMP: pid 83753 tid 83824 thread 1 bound to OS proc set {1}
OMP: pid 83753 tid 83833 thread 10 bound to OS proc set {11}
OMP: pid 83753 tid 83849 thread 26 bound to OS proc set {30}
OMP: pid 83753 tid 83851 thread 28 bound to OS proc set {32}
OMP: pid 83753 tid 83878 thread 55 bound to OS proc set {63}
OMP: pid 83753 tid 83854 thread 31 bound to OS proc set {35}
OMP: pid 83753 tid 83868 thread 45 bound to OS proc set {52}
OMP: pid 83753 tid 83859 thread 36 bound to OS proc set {41}
OMP: pid 83753 tid 83877 thread 54 bound to OS proc set {62}
OMP: pid 83753 tid 83861 thread 38 bound to OS proc set {44}
OMP: pid 83753 tid 83852 thread 29 bound to OS proc set {33}
OMP: pid 83753 tid 83869 thread 46 bound to OS proc set {53}
OMP: pid 83753 tid 83843 thread 20 bound to OS proc set {23}
OMP: pid 83753 tid 83864 thread 41 bound to OS proc set {47}
OMP: pid 83753 tid 83875 thread 52 bound to OS proc set {60}
OMP: pid 83753 tid 83829 thread 6 bound to OS proc set {6}
OMP: pid 83753 tid 83845 thread 22 bound to OS proc set {25}
OMP: pid 83753 tid 83862 thread 39 bound to OS proc set {45}
OMP: pid 83753 tid 83853 thread 30 bound to OS proc set {34}
OMP: pid 83753 tid 83867 thread 44 bound to OS proc set {51}
OMP: pid 83753 tid 83828 thread 5 bound to OS proc set {5}
OMP: pid 83753 tid 83860 thread 37 bound to OS proc set {42}
OMP: pid 83753 tid 83848 thread 25 bound to OS proc set {29}
OMP: pid 83753 tid 83865 thread 42 bound to OS proc set {48}
OMP: pid 83753 tid 83866 thread 43 bound to OS proc set {49}
OMP: pid 83753 tid 83846 thread 23 bound to OS proc set {26}
OMP: pid 83753 tid 83844 thread 21 bound to OS proc set {24}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 56 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 8
8-> 9 9-> 10 10-> 11 11-> 12 12-> 13 13-> 15 14-> 16 15-> 17
16-> 18 17-> 19 18-> 20 19-> 22 20-> 23 21-> 24 22-> 25 23-> 26
24-> 27 25-> 29 26-> 30 27-> 31 28-> 32 29-> 33 30-> 34 31-> 35
32-> 37 33-> 38 34-> 39 35-> 40 36-> 41 37-> 42 38-> 44 39-> 45
40-> 46 41-> 47 42-> 48 43-> 49 44-> 51 45-> 52 46-> 53 47-> 54
48-> 55 49-> 56 50-> 58 51-> 59 52-> 60 53-> 61 54-> 62 55-> 63
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05159
LPlusTimes 10 3.69075
LTimes 10 3.67315
Population 10 0.17611
Scattering 10 47.16586
Solve 1 59.98662
Source 10 0.00382
SweepSolver 10 1.69310
SweepSubdomain 160 1.28271
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051591,3.690753,3.673149,0.176111,47.165862,59.986621,0.003821,1.693103,1.282714
Figures of Merit
================
Throughput: 1.006857e+08 [unknowns/(second/iteration)]
Grind time : 9.931892e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 75.76112 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-38-240.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 83949 tid 83949 thread 0 bound to OS proc set {0}
OMP: pid 83949 tid 84051 thread 32 bound to OS proc set {32}
OMP: pid 83949 tid 84069 thread 50 bound to OS proc set {50}
OMP: pid 83949 tid 84020 thread 1 bound to OS proc set {1}
OMP: pid 83949 tid 84067 thread 48 bound to OS proc set {48}
OMP: pid 83949 tid 84070 thread 51 bound to OS proc set {51}
OMP: pid 83949 tid 84031 thread 12 bound to OS proc set {12}
OMP: pid 83949 tid 84022 thread 3 bound to OS proc set {3}
OMP: pid 83949 tid 84023 thread 4 bound to OS proc set {4}
OMP: pid 83949 tid 84075 thread 56 bound to OS proc set {56}
OMP: pid 83949 tid 84043 thread 24 bound to OS proc set {24}
OMP: pid 83949 tid 84039 thread 20 bound to OS proc set {20}
OMP: pid 83949 tid 84063 thread 44 bound to OS proc set {44}
OMP: pid 83949 tid 84074 thread 55 bound to OS proc set {55}
OMP: pid 83949 tid 84021 thread 2 bound to OS proc set {2}
OMP: pid 83949 tid 84062 thread 43 bound to OS proc set {43}
OMP: pid 83949 tid 84078 thread 59 bound to OS proc set {59}
OMP: pid 83949 tid 84037 thread 18 bound to OS proc set {18}
OMP: pid 83949 tid 84050 thread 31 bound to OS proc set {31}
OMP: pid 83949 tid 84066 thread 47 bound to OS proc set {47}
OMP: pid 83949 tid 84079 thread 60 bound to OS proc set {60}
OMP: pid 83949 tid 84072 thread 53 bound to OS proc set {53}
OMP: pid 83949 tid 84030 thread 11 bound to OS proc set {11}
OMP: pid 83949 tid 84076 thread 57 bound to OS proc set {57}
OMP: pid 83949 tid 84034 thread 15 bound to OS proc set {15}
OMP: pid 83949 tid 84071 thread 52 bound to OS proc set {52}
OMP: pid 83949 tid 84029 thread 10 bound to OS proc set {10}
OMP: pid 83949 tid 84027 thread 8 bound to OS proc set {8}
OMP: pid 83949 tid 84036 thread 17 bound to OS proc set {17}
OMP: pid 83949 tid 84081 thread 62 bound to OS proc set {62}
OMP: pid 83949 tid 84054 thread 35 bound to OS proc set {35}
OMP: pid 83949 tid 84033 thread 14 bound to OS proc set {14}
OMP: pid 83949 tid 84038 thread 19 bound to OS proc set {19}
OMP: pid 83949 tid 84045 thread 26 bound to OS proc set {26}
OMP: pid 83949 tid 84052 thread 33 bound to OS proc set {33}
OMP: pid 83949 tid 84053 thread 34 bound to OS proc set {34}
OMP: pid 83949 tid 84068 thread 49 bound to OS proc set {49}
OMP: pid 83949 tid 84026 thread 7 bound to OS proc set {7}
OMP: pid 83949 tid 84046 thread 27 bound to OS proc set {27}
OMP: pid 83949 tid 84080 thread 61 bound to OS proc set {61}
OMP: pid 83949 tid 84055 thread 36 bound to OS proc set {36}
OMP: pid 83949 tid 84028 thread 9 bound to OS proc set {9}
OMP: pid 83949 tid 84041 thread 22 bound to OS proc set {22}
OMP: pid 83949 tid 84032 thread 13 bound to OS proc set {13}
OMP: pid 83949 tid 84049 thread 30 bound to OS proc set {30}
OMP: pid 83949 tid 84065 thread 46 bound to OS proc set {46}
OMP: pid 83949 tid 84040 thread 21 bound to OS proc set {21}
OMP: pid 83949 tid 84044 thread 25 bound to OS proc set {25}
OMP: pid 83949 tid 84082 thread 63 bound to OS proc set {63}
OMP: pid 83949 tid 84047 thread 28 bound to OS proc set {28}
OMP: pid 83949 tid 84024 thread 5 bound to OS proc set {5}
OMP: pid 83949 tid 84059 thread 40 bound to OS proc set {40}
OMP: pid 83949 tid 84025 thread 6 bound to OS proc set {6}
OMP: pid 83949 tid 84042 thread 23 bound to OS proc set {23}
OMP: pid 83949 tid 84073 thread 54 bound to OS proc set {54}
OMP: pid 83949 tid 84035 thread 16 bound to OS proc set {16}
OMP: pid 83949 tid 84061 thread 42 bound to OS proc set {42}
OMP: pid 83949 tid 84057 thread 38 bound to OS proc set {38}
OMP: pid 83949 tid 84056 thread 37 bound to OS proc set {37}
OMP: pid 83949 tid 84058 thread 39 bound to OS proc set {39}
OMP: pid 83949 tid 84077 thread 58 bound to OS proc set {58}
OMP: pid 83949 tid 84064 thread 45 bound to OS proc set {45}
OMP: pid 83949 tid 84060 thread 41 bound to OS proc set {41}
OMP: pid 83949 tid 84048 thread 29 bound to OS proc set {29}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-25, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/tools/mpi/openmpi-armclang-22.1/bin/mpic++
Compiler Flags: "-O3 -mcpu=native -O3 -mcpu=neoverse-v1 `pkg-config armpl-dynamic-lp64-seq --cflags --libs` -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 64 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 7
8-> 8 9-> 9 10-> 10 11-> 11 12-> 12 13-> 13 14-> 14 15-> 15
16-> 16 17-> 17 18-> 18 19-> 19 20-> 20 21-> 21 22-> 22 23-> 23
24-> 24 25-> 25 26-> 26 27-> 27 28-> 28 29-> 29 30-> 30 31-> 31
32-> 32 33-> 33 34-> 34 35-> 35 36-> 36 37-> 37 38-> 38 39-> 39
40-> 40 41-> 41 42-> 42 43-> 43 44-> 44 45-> 45 46-> 46 47-> 47
48-> 48 49-> 49 50-> 50 51-> 51 52-> 52 53-> 53 54-> 54 55-> 55
56-> 56 57-> 57 58-> 58 59-> 59 60-> 60 61-> 61 62-> 62 63-> 63
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.05188
LPlusTimes 10 2.48399
LTimes 10 4.81768
Population 10 0.17607
Scattering 10 41.76738
Solve 1 54.37305
Source 10 0.00340
SweepSolver 10 1.53783
SweepSubdomain 160 1.12742
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.051877,2.483989,4.817677,0.176073,41.767385,54.373048,0.003395,1.537830,1.127420
Figures of Merit
================
Throughput: 1.110807e+08 [unknowns/(second/iteration)]
Grind time : 9.002462e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 73.31242 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END