* Info: Detected 1 Lprof instances in o401: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 364985)-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 112
Number of Threads counted = 112
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 8936 microseconds.
(= 8936 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 1513533.5 0.009819 0.009093 0.010185
Scale: 1555794.7 0.009767 0.008846 0.010326
Add: 1491930.3 0.014753 0.013837 0.015408
Triad: 1496038.8 0.014681 0.013799 0.015394
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
* Info: Process finished (host o401, process 364985)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0
To display your profiling results:
##############################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/icx_10/oneview_results_1714180334/tools/lprof_npsu_run_0 #
##############################################################################################################################################################################################################