
Executable Output

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 11794)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.24691 +- 0.000001. Correct Result: 234.246915

Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     612.455
Minimum kernel time:       0.0060858
Maximum kernel time:       0.00679828
Arithm. Mean kernel time:  0.00612444

Performance results        
Total GFlops/s:            2.36646
Minimum GFlops/s:          2.13193
Maximum GFlops/s:          2.38152
Arithm. Mean GFlops/s:     2.3665

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 11794)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12055)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.20259 +- 0.000001. Correct Result: 235.202586

Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     334.236
Minimum kernel time:       0.0032774
Maximum kernel time:       0.00420159
Arithm. Mean kernel time:  0.00334226

Performance results        
Total GFlops/s:            4.3363
Minimum GFlops/s:          3.44952
Maximum GFlops/s:          4.42225
Arithm. Mean GFlops/s:     4.33644

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12055)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12293)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.04187 +- 0.000001. Correct Result: 234.041865

Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     212.311
Minimum kernel time:       0.00210197
Maximum kernel time:       0.00284789
Arithm. Mean kernel time:  0.00212304

Performance results        
Total GFlops/s:            6.82655
Minimum GFlops/s:          5.0892
Maximum GFlops/s:          6.89521
Arithm. Mean GFlops/s:     6.82678

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12293)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12530)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.03352 +- 0.000001. Correct Result: 235.033523

Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     138.754
Minimum kernel time:       0.00137029
Maximum kernel time:       0.00198869
Arithm. Mean kernel time:  0.00138747

Performance results        
Total GFlops/s:            10.4455
Minimum GFlops/s:          7.28797
Maximum GFlops/s:          10.577
Arithm. Mean GFlops/s:     10.446

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12530)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12767)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.96475 +- 0.000001. Correct Result: 233.964750

Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     90.8681
Minimum kernel time:       0.000709572
Maximum kernel time:       0.00492626
Arithm. Mean kernel time:  0.000908622

Performance results        
Total GFlops/s:            15.95
Minimum GFlops/s:          2.94209
Maximum GFlops/s:          20.4257
Arithm. Mean GFlops/s:     15.9511

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12767)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13009)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.42291 +- 0.000001. Correct Result: 235.422912

Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     61.4245
Minimum kernel time:       0.000445407
Maximum kernel time:       0.00910249
Arithm. Mean kernel time:  0.000614174

Performance results        
Total GFlops/s:            23.5956
Minimum GFlops/s:          1.59226
Maximum GFlops/s:          32.5399
Arithm. Mean GFlops/s:     23.5984

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13009)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13267)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.66143 +- 0.000001. Correct Result: 233.661434

Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     28.2733
Minimum kernel time:       0.00019818
Maximum kernel time:       0.0047793
Arithm. Mean kernel time:  0.000282656

Performance results        
Total GFlops/s:            51.2621
Minimum GFlops/s:          3.03256
Maximum GFlops/s:          73.133
Arithm. Mean GFlops/s:     51.276

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13267)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13544)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.57167 +- 0.000001. Correct Result: 234.571672

Number of Threads:         104
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     8.26536
Minimum kernel time:       7.0994e-05
Maximum kernel time:       0.00580142
Arithm. Mean kernel time:  8.25941e-05

Performance results        
Total GFlops/s:            175.352
Minimum GFlops/s:          2.49826
Maximum GFlops/s:          204.151
Arithm. Mean GFlops/s:     175.479

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13544)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #

* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13869)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.32236 +- 0.000001. Correct Result: 235.322355

Number of Threads:         208
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     11.1123
Minimum kernel time:       8.5977e-05
Maximum kernel time:       0.00930715
Arithm. Mean kernel time:  0.000111039

Performance results        
Total GFlops/s:            130.427
Minimum GFlops/s:          1.55724
Maximum GFlops/s:          168.574
Arithm. Mean GFlops/s:     130.526

* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13869)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8

To display your profiling results:
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
