* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 706528)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.88004 +- 0.000001. Correct Result: 234.880041
Configuration
Number of Threads: 1
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 1093.33
Minimum kernel time: 0.0108221
Maximum kernel time: 0.019305
Arithm. Mean kernel time: 0.0109333
Performance results
Total GFlops/s: 1.32562
Minimum GFlops/s: 0.750764
Maximum GFlops/s: 1.33925
Arithm. Mean GFlops/s: 1.32563
* Info: Process finished (host skylake, process 706528)
* Info: Dumping samples (host skylake, process 706528)
* Info: Dumping source info for callchain nodes (host skylake, process 706528)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706528)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 706660)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.43286 +- 0.000001. Correct Result: 234.432862
Configuration
Number of Threads: 2
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 627.395
Minimum kernel time: 0.00603509
Maximum kernel time: 0.0211191
Arithm. Mean kernel time: 0.00627379
Performance results
Total GFlops/s: 2.31011
Minimum GFlops/s: 0.686274
Maximum GFlops/s: 2.40154
Arithm. Mean GFlops/s: 2.31017
* Info: Process finished (host skylake, process 706660)
* Info: Dumping samples (host skylake, process 706660)
* Info: Dumping source info for callchain nodes (host skylake, process 706660)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706660)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 706764)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.59205 +- 0.000001. Correct Result: 234.592049
Configuration
Number of Threads: 4
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 304.998
Minimum kernel time: 0.00294995
Maximum kernel time: 0.0110881
Arithm. Mean kernel time: 0.00304989
Performance results
Total GFlops/s: 4.752
Minimum GFlops/s: 1.30712
Maximum GFlops/s: 4.91313
Arithm. Mean GFlops/s: 4.75213
* Info: Process finished (host skylake, process 706764)
* Info: Dumping samples (host skylake, process 706764)
* Info: Dumping source info for callchain nodes (host skylake, process 706764)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706764)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 706863)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.21198 +- 0.000001. Correct Result: 234.211983
Configuration
Number of Threads: 8
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 161.584
Minimum kernel time: 0.00153303
Maximum kernel time: 0.00894308
Arithm. Mean kernel time: 0.00161572
Performance results
Total GFlops/s: 8.96961
Minimum GFlops/s: 1.62064
Maximum GFlops/s: 9.45414
Arithm. Mean GFlops/s: 8.97031
* Info: Process finished (host skylake, process 706863)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 706863)
* Info: Dumping source info for callchain nodes (host skylake, process 706863)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706863)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 706950)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 232.04590 +- 0.000001. Correct Result: 232.045902
Configuration
Number of Threads: 16
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 79.459
Minimum kernel time: 0.000757933
Maximum kernel time: 0.00465798
Arithm. Mean kernel time: 0.000794509
Performance results
Total GFlops/s: 18.2402
Minimum GFlops/s: 3.11154
Maximum GFlops/s: 19.1224
Arithm. Mean GFlops/s: 18.2421
* Info: Process finished (host skylake, process 706950)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 706950)
* Info: Dumping source info for callchain nodes (host skylake, process 706950)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706950)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 707036)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.04515 +- 0.000001. Correct Result: 233.045148
Configuration
Number of Threads: 26
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 53.072
Minimum kernel time: 0.000494003
Maximum kernel time: 0.00719094
Arithm. Mean kernel time: 0.000530647
Performance results
Total GFlops/s: 27.3091
Minimum GFlops/s: 2.01552
Maximum GFlops/s: 29.3389
Arithm. Mean GFlops/s: 27.3129
* Info: Process finished (host skylake, process 707036)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 707036)
* Info: Dumping source info for callchain nodes (host skylake, process 707036)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 707036)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5 #
######################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 707132)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.89933 +- 0.000001. Correct Result: 233.899329
Configuration
Number of Threads: 52
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 29.832
Minimum kernel time: 0.000257015
Maximum kernel time: 0.0319731
Arithm. Mean kernel time: 0.000298199
Performance results
Total GFlops/s: 48.5837
Minimum GFlops/s: 0.453302
Maximum GFlops/s: 56.3916
Arithm. Mean GFlops/s: 48.6034
* Info: Process finished (host skylake, process 707132)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 707132)
* Info: Dumping source info for callchain nodes (host skylake, process 707132)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 707132)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6 #
######################################################################################################################################################