options

Executable Output


* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com. 
If this is incorrect, rerun with number-processes-per-node=X

* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com

* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170255)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170258)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170257)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170259)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170262)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170261)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170260)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170263)Running with these driver parameters:
  solver ID    = 1

  Laplacian_27pt:
    (Nx, Ny, Nz) = (400, 3200, 400)
    (Px, Py, Pz) = (1, 8, 1)

=============================================
Generate Matrix:
=============================================
Spatial Operator:
  wall clock time = 3.339041 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 77.951675 seconds
  cpu MFLOPS      = 0.000000

  RHS vector has unit components
  Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
  wall clock time = 0.377322 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 5.932266 seconds
  cpu MFLOPS      = 0.000000

=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
  wall clock time = 17.050122 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 291.553719 seconds
  cpu MFLOPS      = 0.000000


FOM_Setup: nnz_AP / Setup Phase Time: 8.829849e+08

=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
  wall clock time = 56.698744 seconds
  wall MFLOPS     = 0.000000
  cpu clock time  = 1344.512809 seconds
  cpu MFLOPS      = 0.000000


Iterations = 23
Final Relative Residual Norm = 9.722267e-09


FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 6.107102e+09



Figure of Merit (FOM_1): 4.801073e+09


* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170259)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170260)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170261)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170258)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170257)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170262)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170263)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170255)

Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0

To display your profiling results:
######################################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                               COMMAND                                                                               #
######################################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0  #
######################################################################################################################################################################################################

×