* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712613)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.14376 +- 0.000001. Correct Result: 235.143764
Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     1086.96
Minimum kernel time:       0.0106314
Maximum kernel time:       0.0116609
Arithm. Mean kernel time:  0.0108695
Performance results        
Total GFlops/s:            1.33339
Minimum GFlops/s:          1.24292
Maximum GFlops/s:          1.36327
Arithm. Mean GFlops/s:     1.33341
* Info: Process finished (host skylake, process 712613)
* Info: Dumping samples (host skylake, process 712613)
* Info: Dumping source info for callchain nodes (host skylake, process 712613)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712613)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712749)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.20245 +- 0.000001. Correct Result: 233.202453
Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     561.626
Minimum kernel time:       0.00550001
Maximum kernel time:       0.0079923
Arithm. Mean kernel time:  0.00561613
Performance results        
Total GFlops/s:            2.58063
Minimum GFlops/s:          1.81343
Maximum GFlops/s:          2.63517
Arithm. Mean GFlops/s:     2.58069
* Info: Process finished (host skylake, process 712749)
* Info: Dumping samples (host skylake, process 712749)
* Info: Dumping source info for callchain nodes (host skylake, process 712749)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712749)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712850)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.42015 +- 0.000001. Correct Result: 234.420152
Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     292.985
Minimum kernel time:       0.00278983
Maximum kernel time:       0.0221535
Arithm. Mean kernel time:  0.00292976
Performance results        
Total GFlops/s:            4.94684
Minimum GFlops/s:          0.654231
Maximum GFlops/s:          5.19512
Arithm. Mean GFlops/s:     4.947
* Info: Process finished (host skylake, process 712850)
* Info: Dumping samples (host skylake, process 712850)
* Info: Dumping source info for callchain nodes (host skylake, process 712850)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712850)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712938)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.66557 +- 0.000001. Correct Result: 233.665565
Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     156.983
Minimum kernel time:       0.00147383
Maximum kernel time:       0.00450785
Arithm. Mean kernel time:  0.0015697
Performance results        
Total GFlops/s:            9.23255
Minimum GFlops/s:          3.21517
Maximum GFlops/s:          9.8339
Arithm. Mean GFlops/s:     9.23327
* Info: Process finished (host skylake, process 712938)
* Info: Dumping samples (host skylake, process 712938)
* Info: Dumping source info for callchain nodes (host skylake, process 712938)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712938)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713023)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.58561 +- 0.000001. Correct Result: 234.585612
Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     86.2404
Minimum kernel time:       0.00082339
Maximum kernel time:       0.00680713
Arithm. Mean kernel time:  0.000862338
Performance results        
Total GFlops/s:            16.8059
Minimum GFlops/s:          2.12916
Maximum GFlops/s:          17.6022
Arithm. Mean GFlops/s:     16.8072
* Info: Process finished (host skylake, process 713023)
* Info: Dumping samples (host skylake, process 713023)
* Info: Dumping source info for callchain nodes (host skylake, process 713023)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713023)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713108)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.05586 +- 0.000001. Correct Result: 234.055865
Configuration              
Number of Threads:         26
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     70.8301
Minimum kernel time:       0.000689473
Maximum kernel time:       0.0041108
Arithm. Mean kernel time:  0.000708186
Performance results        
Total GFlops/s:            20.4623
Minimum GFlops/s:          3.52571
Maximum GFlops/s:          21.0211
Arithm. Mean GFlops/s:     20.4657
* Info: Process finished (host skylake, process 713108)
* Info: Dumping samples (host skylake, process 713108)
* Info: Dumping source info for callchain nodes (host skylake, process 713108)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713108)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713205)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.71083 +- 0.000001. Correct Result: 233.710830
Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     33.2772
Minimum kernel time:       0.000286154
Maximum kernel time:       0.0131419
Arithm. Mean kernel time:  0.000332461
Performance results        
Total GFlops/s:            43.5538
Minimum GFlops/s:          1.10285
Maximum GFlops/s:          50.6492
Arithm. Mean GFlops/s:     43.5946
* Info: Process finished (host skylake, process 713205)
* Info: Dumping samples (host skylake, process 713205)
* Info: Dumping source info for callchain nodes (host skylake, process 713205)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713205)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6
To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
###############################################################################################################################################