-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 6
Number of Threads counted = 6
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 67492 microseconds.
(= 67492 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 97281.9 0.142511 0.141471 0.143252
Scale: 95985.9 0.143705 0.143381 0.145043
Add: 108959.8 0.189864 0.189463 0.190552
Triad: 108820.8 0.190025 0.189705 0.190536
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_0 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 96
Number of Threads counted = 96
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 15926 microseconds.
(= 15926 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 654055.9 0.021215 0.021042 0.021442
Scale: 655385.2 0.021119 0.020999 0.021418
Add: 894138.1 0.023224 0.023088 0.023766
Triad: 898582.8 0.023154 0.022974 0.023756
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_1 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 120
Number of Threads counted = 120
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 15870 microseconds.
(= 15870 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 911125.6 0.015260 0.015105 0.015512
Scale: 906931.3 0.015291 0.015175 0.015606
Add: 917911.0 0.022814 0.022490 0.023330
Triad: 918573.1 0.022698 0.022474 0.023226
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_2 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 126
Number of Threads counted = 126
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 14881 microseconds.
(= 14881 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 950195.2 0.014569 0.014484 0.015313
Scale: 942638.6 0.014676 0.014600 0.015141
Add: 960131.1 0.021594 0.021501 0.022318
Triad: 959514.0 0.021563 0.021515 0.021648
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_3 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 144
Number of Threads counted = 144
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 16817 microseconds.
(= 16817 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 839480.5 0.016760 0.016394 0.017207
Scale: 833540.7 0.016898 0.016511 0.017309
Add: 845852.5 0.024782 0.024406 0.025229
Triad: 851994.9 0.024712 0.024230 0.025086
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_4 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 168
Number of Threads counted = 168
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 15465 microseconds.
(= 15465 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 955337.5 0.014508 0.014406 0.014674
Scale: 953081.9 0.014529 0.014440 0.014844
Add: 977550.6 0.021195 0.021118 0.022000
Triad: 980051.2 0.021120 0.021064 0.021291
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_5 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 192
Number of Threads counted = 192
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 19056 microseconds.
(= 19056 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 727749.5 0.019291 0.018911 0.019715
Scale: 721536.5 0.019395 0.019074 0.019723
Add: 750004.7 0.027816 0.027525 0.028109
Triad: 749894.3 0.027784 0.027529 0.028110
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_6 #
###############################################################################################################################################################################################################################################
OMP: pid 40964 tid 0x1468e553c640 thread 196 bound to OS proc se-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 216
Number of Threads counted = 216
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 17333 microseconds.
(= 17333 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 802764.1 0.017517 0.017144 0.018661
Scale: 798942.0 0.017567 0.017226 0.017944
Add: 827083.5 0.025244 0.024960 0.025947
Triad: 828571.4 0.025204 0.024915 0.026496
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_7 #
###############################################################################################################################################################################################################################################
OMP: pid 41429 tid 0x1527dc91f640 thread 225 bound to OS proc se-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 240
Number of Threads counted = 240
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 16280 microseconds.
(= 16280 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 905721.7 0.015505 0.015195 0.015760
Scale: 898223.9 0.015595 0.015322 0.018575
Add: 929359.2 0.022674 0.022213 0.036747
Triad: 932050.3 0.022416 0.022149 0.023031
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_8 #
###############################################################################################################################################################################################################################################
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 256
Number of Threads counted = 256
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 14931 microseconds.
(= 14931 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 966680.5 0.014549 0.014237 0.026187
Scale: 957397.4 0.014766 0.014375 0.034470
Add: 991055.5 0.021069 0.020830 0.028472
Triad: 994938.8 0.020886 0.020749 0.021585
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Your experiment path is /beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9
To display your profiling results:
###############################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/qaas_runs_test/isix02.benchmarkcenter.megware.com/177-373-7005/stream/run/oneview_runs/multicore/gcc_2/oneview_results_1773739244/tools/lprof_run_9 #
###############################################################################################################################################################################################################################################