options

Executable Output


* Info: Detected 1 Lprof instances in ip-172-31-68-94: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Warning: perf-events measurements are not allowed on node ip-172-31-68-94: selecting the no-perf engine. Try:
sudo sysctl -w kernel.perf_event_paranoid=1 (*)
To persist across reboots:
 sudo sh -c 'echo kernel.perf_event_paranoid=1 >> /etc/sysctl.d/local.conf' (*)
(*) requires sudo permissions. If missing, contact administrators.
=1 allows both kernel+user-space measurements (=2: only user-space)

* Warning: The 'no-perf' engine is feature-limited and suffers higher overhead than other engines. It should be used only when perf-events are not available on the running Linux kernel - for instance with WSL1 (Windows Subsystem for Linux version 1) - or when the paranoid level (as displayed by 'sysctl kernel.perf_event_paranoid') cannot be lowered to 2 or less.

* Info: Process launched (host ip-172-31-68-94, process 606584)
   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.4

LLNL-CODE-775068

Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /opt/intel/oneapi/mpi/2021.10.0/bin/mpiicpc
  Compiler Flags:         "-O3 -march=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=clang++     -Wall -Wextra  "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 1 threads on rank 0
    0->  0

Input Parameters
================

  Problem Size:
    Zones:                 16 x 16 x 16  (4096 total)
    Groups:                64
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       1
    Spatial decomp:        1 x 1 x 1 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 32 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             1           1 / 1
  (Rx,Ry,Rz) R in XYZ:   1x1x1       1x1x1 / 1x1x1
  (PQR) TOTAL:           1           16 / 16

  Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                        61440        0.469
  dx                                  16        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                        1572864       12.000
  j_plane                        1572864       12.000
  k_plane                        1572864       12.000
  mixelem_to_fraction               4352        0.033
  phi                            6553600       50.000
  phi_out                        6553600       50.000
  psi                           25165824      192.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                           25165824      192.000
  sigt_zonal                      262144        2.000
  volume                            4096        0.031
  --------                  ------------    ---------
  TOTAL                         68494704      522.573

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=7.487488e+07, change=1.000000e+00
  iter 1: particle count=1.125855e+08, change=3.349511e-01
  iter 2: particle count=1.313924e+08, change=1.431351e-01
  iter 3: particle count=1.407381e+08, change=6.640521e-02
  iter 4: particle count=1.453680e+08, change=3.184924e-02
  iter 5: particle count=1.476542e+08, change=1.548355e-02
  iter 6: particle count=1.487795e+08, change=7.563193e-03
  iter 7: particle count=1.493316e+08, change=3.697158e-03
  iter 8: particle count=1.496017e+08, change=1.805479e-03
  iter 9: particle count=1.497334e+08, change=8.801810e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.00544
  LPlusTimes                  10       1.46414
  LTimes                      10       2.99898
  Population                  10       0.23423
  Scattering                  10       8.34545
  Solve                        1      16.11811
  Source                      10       0.00619
  SweepSolver                 10       2.79040
  SweepSubdomain             160       2.75493

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.005437,1.464143,2.998977,0.234227,8.345455,16.118114,0.006194,2.790400,2.754931

Figures of Merit
================

  Throughput:         1.561338e+07 [unknowns/(second/iteration)]
  Grind time :        6.404763e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  98.72891 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 25165824

END

* Info: Process finished (host ip-172-31-68-94, process 606584)
* Warning: Restricted access to kernel symbols: 
to see kernel functions in profiling results, reprofile as root 
or execute sudo sysctl -w kernel.kptr_restrict=0.
To make kptr_restrict=0 persist across reboots:
 sudo sh -c "echo kernel.kptr_restrict=0 >> /etc/sysctl.d/local.conf"


Your experiment path is /home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0

To display your profiling results:
####################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                      COMMAND                                                                      #
####################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-770-1548/intel/Kripke/run/oneview_runs/orig/oneview_results_1697701791/tools/lprof_npsu_run_0  #
####################################################################################################################################################################################

×