options

Executable Output


* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14  Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank    Pid      Node name                          Pin cpu
[0] MPI startup(): 0       138789   gmz10.benchmarkcenter.megware.com  {0}
[0] MPI startup(): 1       138794   gmz10.benchmarkcenter.megware.com  {32}
[0] MPI startup(): 2       138816   gmz10.benchmarkcenter.megware.com  {64}
[0] MPI startup(): 3       138795   gmz10.benchmarkcenter.megware.com  {96}
[0] MPI startup(): 4       138801   gmz10.benchmarkcenter.megware.com  {128}
[0] MPI startup(): 5       138804   gmz10.benchmarkcenter.megware.com  {160}
[0] MPI startup(): 6       138799   gmz10.benchmarkcenter.megware.com  {192}
[0] MPI startup(): 7       138800   gmz10.benchmarkcenter.megware.com  {224}
Running with these driver parameters:
  solver ID    = 1

  Laplacian_27pt:
    (Nx, Ny, Nz) = (400, 3200, 400)
    (Px, Py, Pz) = (1, 8, 1)

=============================================
Generate Matrix:
=============================================
Spatial Operator:
  wall clock time = 2.315648 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 63.212615 seconds
  cpu MFLOPS      = 0.000000

  RHS vector has unit components
  Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
  wall clock time = 0.150127 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 1.260244 seconds
  cpu MFLOPS      = 0.000000

=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
  wall clock time = 12.998911 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 267.184820 seconds
  cpu MFLOPS      = 0.000000


FOM_Setup: nnz_AP / Setup Phase Time: 1.158174e+09

=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
  wall clock time = 39.953309 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 1261.361982 seconds
  cpu MFLOPS      = 0.000000


Iterations = 23
Final Relative Residual Norm = 9.722267e-09


FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 8.666741e+09



Figure of Merit (FOM_1): 6.789600e+09



Your experiment path is /home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0

To display your profiling results:
##################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                             COMMAND                                                                             #
##################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0  #
##################################################################################################################################################################################################

×