* Info: Detected 6 Lprof instances in isix02.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_gnr_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 52826 isix02.benchmarkcenter.megware.com {0-42,256-298}
[0] MPI startup(): 1 52821 isix02.benchmarkcenter.megware.com {43-85,299-341}
[0] MPI startup(): 2 52822 isix02.benchmarkcenter.megware.com {86-127,342-383}
[0] MPI startup(): 3 52816 isix02.benchmarkcenter.megware.com {128-170,384-426}
[0] MPI startup(): 4 52839 isix02.benchmarkcenter.megware.com {171-213,427-469}
[0] MPI startup(): 5 52817 isix02.benchmarkcenter.megware.com {214-255,470-511}
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2025.0.0/mpi/2021.14/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O3 -march=graniterapids -fno-tree-vectorize -fno-openmp-simd -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 42 threads on rank 0
0-> 0 1->257 2-> 3 3-> 4 4-> 5 5-> 6 6-> 7 7-> 8
8-> 9 9-> 10 10-> 11 11-> 12 12-> 13 13-> 14 14-> 15 15-> 16
16-> 17 17-> 18 18-> 19 19-> 20 20-> 21 21-> 22 22-> 23 23-> 24
24-> 25 25-> 26 26-> 27 27-> 28 28-> 29 29-> 30 30-> 31 31-> 32
32-> 33 33-> 34 34-> 35 35-> 36 36-> 37 37-> 38 38-> 39 39-> 40
40-> 41 41-> 42
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 8 (3072 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 6
Spatial decomp: 3 x 2 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 6 1 / 6
(Rx,Ry,Rz) R in XYZ: 3x2x1 1x1x1 / 3x2x1
(PQR) TOTAL: 6 16 / 96
Material Volumes=[1.125000e+04, 1.425000e+05, 2.726250e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 8 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 12582912 96.000
j_plane 18874368 144.000
k_plane 37748736 288.000
mixelem_to_fraction 3248 0.025
phi 78643200 600.000
phi_out 78643200 600.000
psi 301989888 2304.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 301989888 2304.000
sigt_zonal 3145728 24.000
volume 3072 0.023
-------- ------------ ---------
TOTAL 849358112 6480.088
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.537834e+09, change=1.000000e+00
iter 1: particle count=2.312865e+09, change=3.350955e-01
iter 2: particle count=2.700300e+09, change=1.434784e-01
iter 3: particle count=2.893534e+09, change=6.678156e-02
iter 4: particle count=2.989631e+09, change=3.214330e-02
iter 5: particle count=3.037241e+09, change=1.567557e-02
iter 6: particle count=3.060734e+09, change=7.675566e-03
iter 7: particle count=3.072281e+09, change=3.758278e-03
iter 8: particle count=3.077935e+09, change=1.837029e-03
iter 9: particle count=3.080695e+09, change=8.958400e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04751
LPlusTimes 10 0.20606
LTimes 10 0.42818
Population 10 0.09196
Scattering 10 6.34054
Solve 1 8.89023
Source 10 0.00184
SweepSolver 10 1.18391
SweepSubdomain 160 0.20450
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.047509,0.206058,0.428185,0.091962,6.340536,8.890227,0.001838,1.183912,0.204497
Figures of Merit
================
Throughput: 3.396875e+08 [unknowns/(second/iteration)]
Grind time : 2.943882e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 17.27300 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 301989888
END
Info: 1/6 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0
To display your profiling results:
####################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
####################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0 #
####################################################################################################################################################################################################