* Info: Detected 6 Lprof instances in isix02.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
[0] MPI startup(): Intel(R) MPI Library, Version 2021.14 Build 20240911 (id: b3fc682)
[0] MPI startup(): Copyright (C) 2003-2024 Intel Corporation. All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): Load tuning file: "/cluster/intel/oneapi/2025.0.0/mpi/2021.14/opt/mpi/etc/tuning_gnr_shm.dat"
[0] MPI startup(): ===== CPU pinning =====
[0] MPI startup(): Rank Pid Node name Pin cpu
[0] MPI startup(): 0 113388 isix02.benchmarkcenter.megware.com {0-42,256-298}
[0] MPI startup(): 1 113399 isix02.benchmarkcenter.megware.com {43-85,299-341}
[0] MPI startup(): 2 113385 isix02.benchmarkcenter.megware.com {86-127,342-383}
[0] MPI startup(): 3 113380 isix02.benchmarkcenter.megware.com {128-170,384-426}
[0] MPI startup(): 4 113382 isix02.benchmarkcenter.megware.com {171-213,427-469}
[0] MPI startup(): 5 113386 isix02.benchmarkcenter.megware.com {214-255,470-511}
Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 2400, 400)
(Px, Py, Pz) = (1, 6, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 2.324441 seconds
wall MFLOPS = 0.000000
cpu clock time = 68.259055 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.313949 seconds
wall MFLOPS = 0.000000
cpu clock time = 2.412117 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 13.210436 seconds
wall MFLOPS = 0.000000
cpu clock time = 285.792040 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 8.546566e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 31.202603 seconds
wall MFLOPS = 0.000000
cpu clock time = 1272.640927 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.946644e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 8.322347e+09
Figure of Merit (FOM_1): 6.455424e+09
Your experiment path is /home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_GNR/173-927-0874/intel/AMG/run/oneview_runs/compilers/gcc_5/oneview_results_1739275792/tools/lprof_npsu_run_0 #
#################################################################################################################################################################################################