* Info: Detected 1 Lprof instances in o404: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-high-ppn' engine for node o404
* Info: Process launched (host o404, process 117267)-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 112
Number of Threads counted = 112
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 9899 microseconds.
(= 9899 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 1058754.6 0.013764 0.012999 0.021361
Scale: 1079928.9 0.013479 0.012744 0.019283
Add: 1183506.8 0.018687 0.017443 0.021126
Triad: 1134029.3 0.019325 0.018204 0.028960
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
* Info: Process finished (host o404, process 117267)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/gcc_2/oneview_results_1714153549/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################################