* Info: Detected 1 Lprof instances in o401: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 365534)-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 112
Number of Threads counted = 112
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 8872 microseconds.
(= 8872 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 1519862.0 0.009879 0.009055 0.013626
Scale: 1532573.0 0.009990 0.008980 0.020089
Add: 1547957.3 0.014675 0.013336 0.016606
Triad: 1509054.7 0.014772 0.013680 0.016723
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
* Info: Process finished (host o401, process 365534)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-417-9108/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714180356/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################################