options

Executable Output


* Info: Detected 6 Lprof instances in isix02.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14  Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_gnr_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank    Pid      Node name                           Pin cpu
[0] MPI startup(): 0       52826    isix02.benchmarkcenter.megware.com  {0-42,256-298}
[0] MPI startup(): 1       52821    isix02.benchmarkcenter.megware.com  {43-85,299-341}
[0] MPI startup(): 2       52822    isix02.benchmarkcenter.megware.com  {86-127,342-383}
[0] MPI startup(): 3       52816    isix02.benchmarkcenter.megware.com  {128-170,384-426}
[0] MPI startup(): 4       52839    isix02.benchmarkcenter.megware.com  {171-213,427-469}
[0] MPI startup(): 5       52817    isix02.benchmarkcenter.megware.com  {214-255,470-511}

   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2025.0.0/mpi/2021.14/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -O3 -march=graniterapids -fno-tree-vectorize -fno-openmp-simd -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=g++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 42 threads on rank 0
    0->  0    1->257    2->  3    3->  4    4->  5    5->  6    6->  7    7->  8
    8->  9    9-> 10   10-> 11   11-> 12   12-> 13   13-> 14   14-> 15   15-> 16
   16-> 17   17-> 18   18-> 19   19-> 20   20-> 21   21-> 22   22-> 23   23-> 24
   24-> 25   25-> 26   26-> 27   27-> 28   28-> 29   29-> 30   30-> 31   31-> 32
   32-> 33   33-> 34   34-> 35   35-> 36   36-> 37   37-> 38   38-> 39   39-> 40
   40-> 41   41-> 42

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 8  (3072 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       6
    Spatial decomp:        3 x 2 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             6           1 / 6
  (Rx,Ry,Rz) R in XYZ:   3x2x1       1x1x1 / 3x2x1
  (PQR) TOTAL:           6           16 / 96

  Material Volumes=[1.125000e+04, 1.425000e+05, 2.726250e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                   8        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       12582912       96.000
  j_plane                       18874368      144.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               3248        0.025
  phi                           78643200      600.000
  phi_out                       78643200      600.000
  psi                          301989888     2304.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          301989888     2304.000
  sigt_zonal                     3145728       24.000
  volume                            3072        0.023
  --------                  ------------    ---------
  TOTAL                        849358112     6480.088

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.537834e+09, change=1.000000e+00
  iter 1: particle count=2.312865e+09, change=3.350955e-01
  iter 2: particle count=2.700300e+09, change=1.434784e-01
  iter 3: particle count=2.893534e+09, change=6.678156e-02
  iter 4: particle count=2.989631e+09, change=3.214330e-02
  iter 5: particle count=3.037241e+09, change=1.567557e-02
  iter 6: particle count=3.060734e+09, change=7.675566e-03
  iter 7: particle count=3.072281e+09, change=3.758278e-03
  iter 8: particle count=3.077935e+09, change=1.837029e-03
  iter 9: particle count=3.080695e+09, change=8.958400e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.04751
  LPlusTimes                  10       0.20606
  LTimes                      10       0.42818
  Population                  10       0.09196
  Scattering                  10       6.34054
  Solve                        1       8.89023
  Source                      10       0.00184
  SweepSolver                 10       1.18391
  SweepSubdomain             160       0.20450

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.047509,0.206058,0.428185,0.091962,6.340536,8.890227,0.001838,1.183912,0.204497

Figures of Merit
================

  Throughput:         3.396875e+08 [unknowns/(second/iteration)]
  Grind time :        2.943882e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  17.27300 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 301989888

END


Info: 1/6 lprof instances finished


Your experiment path is /home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0

To display your profiling results:
####################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                              COMMAND                                                                              #
####################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_GNR/173-926-6874/intel/Kripke/run/oneview_runs/compilers/gcc_5/oneview_results_1739268825/tools/lprof_npsu_run_0  #
####################################################################################################################################################################################################

×