* Info: Detected 8 Lprof instances in gmz10.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_generic_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 138789 gmz10.benchmarkcenter.megware.com {0}
[0] MPI startup(): 1 138794 gmz10.benchmarkcenter.megware.com {32}
[0] MPI startup(): 2 138816 gmz10.benchmarkcenter.megware.com {64}
[0] MPI startup(): 3 138795 gmz10.benchmarkcenter.megware.com {96}
[0] MPI startup(): 4 138801 gmz10.benchmarkcenter.megware.com {128}
[0] MPI startup(): 5 138804 gmz10.benchmarkcenter.megware.com {160}
[0] MPI startup(): 6 138799 gmz10.benchmarkcenter.megware.com {192}
[0] MPI startup(): 7 138800 gmz10.benchmarkcenter.megware.com {224}
Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 2.315648 seconds
wall MFLOPS = 0.000000
cpu clock time = 63.212615 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.150127 seconds
wall MFLOPS = 0.000000
cpu clock time = 1.260244 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 12.998911 seconds
wall MFLOPS = 0.000000
cpu clock time = 267.184820 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 1.158174e+09
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 39.953309 seconds
wall MFLOPS = 0.000000
cpu clock time = 1261.361982 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 8.666741e+09
Figure of Merit (FOM_1): 6.789600e+09
Your experiment path is /home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0
To display your profiling results:
##################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_ZEN5/173-989-6379/intel/AMG/run/oneview_runs/compilers/gcc_3/oneview_results_1739902763/tools/lprof_npsu_run_0 #
##################################################################################################################################################################################################