* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 11794)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.24691 +- 0.000001. Correct Result: 234.246915
Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     612.455
Minimum kernel time:       0.0060858
Maximum kernel time:       0.00679828
Arithm. Mean kernel time:  0.00612444
Performance results        
Total GFlops/s:            2.36646
Minimum GFlops/s:          2.13193
Maximum GFlops/s:          2.38152
Arithm. Mean GFlops/s:     2.3665
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 11794)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 11794)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_0  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12055)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.20259 +- 0.000001. Correct Result: 235.202586
Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     334.236
Minimum kernel time:       0.0032774
Maximum kernel time:       0.00420159
Arithm. Mean kernel time:  0.00334226
Performance results        
Total GFlops/s:            4.3363
Minimum GFlops/s:          3.44952
Maximum GFlops/s:          4.42225
Arithm. Mean GFlops/s:     4.33644
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12055)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12055)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_1  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12293)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.04187 +- 0.000001. Correct Result: 234.041865
Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     212.311
Minimum kernel time:       0.00210197
Maximum kernel time:       0.00284789
Arithm. Mean kernel time:  0.00212304
Performance results        
Total GFlops/s:            6.82655
Minimum GFlops/s:          5.0892
Maximum GFlops/s:          6.89521
Arithm. Mean GFlops/s:     6.82678
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12293)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12293)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_2  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12530)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.03352 +- 0.000001. Correct Result: 235.033523
Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     138.754
Minimum kernel time:       0.00137029
Maximum kernel time:       0.00198869
Arithm. Mean kernel time:  0.00138747
Performance results        
Total GFlops/s:            10.4455
Minimum GFlops/s:          7.28797
Maximum GFlops/s:          10.577
Arithm. Mean GFlops/s:     10.446
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12530)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12530)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_3  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 12767)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.96475 +- 0.000001. Correct Result: 233.964750
Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     90.8681
Minimum kernel time:       0.000709572
Maximum kernel time:       0.00492626
Arithm. Mean kernel time:  0.000908622
Performance results        
Total GFlops/s:            15.95
Minimum GFlops/s:          2.94209
Maximum GFlops/s:          20.4257
Arithm. Mean GFlops/s:     15.9511
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 12767)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 12767)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_4  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13009)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.42291 +- 0.000001. Correct Result: 235.422912
Configuration              
Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     61.4245
Minimum kernel time:       0.000445407
Maximum kernel time:       0.00910249
Arithm. Mean kernel time:  0.000614174
Performance results        
Total GFlops/s:            23.5956
Minimum GFlops/s:          1.59226
Maximum GFlops/s:          32.5399
Arithm. Mean GFlops/s:     23.5984
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13009)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13009)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_5  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13267)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.66143 +- 0.000001. Correct Result: 233.661434
Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     28.2733
Minimum kernel time:       0.00019818
Maximum kernel time:       0.0047793
Arithm. Mean kernel time:  0.000282656
Performance results        
Total GFlops/s:            51.2621
Minimum GFlops/s:          3.03256
Maximum GFlops/s:          73.133
Arithm. Mean GFlops/s:     51.276
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13267)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13267)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_6  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13544)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.57167 +- 0.000001. Correct Result: 234.571672
Configuration              
Number of Threads:         104
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     8.26536
Minimum kernel time:       7.0994e-05
Maximum kernel time:       0.00580142
Arithm. Mean kernel time:  8.25941e-05
Performance results        
Total GFlops/s:            175.352
Minimum GFlops/s:          2.49826
Maximum GFlops/s:          204.151
Arithm. Mean GFlops/s:     175.479
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13544)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13544)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_7  #
###############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 13869)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.32236 +- 0.000001. Correct Result: 235.322355
Configuration              
Number of Threads:         208
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt
Time measurements          
Total experiment time:     11.1123
Minimum kernel time:       8.5977e-05
Maximum kernel time:       0.00930715
Arithm. Mean kernel time:  0.000111039
Performance results        
Total GFlops/s:            130.427
Minimum GFlops/s:          1.55724
Maximum GFlops/s:          168.574
Arithm. Mean GFlops/s:     130.526
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 13869)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 13869)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8
To display your profiling results:
###############################################################################################################################
#    LEVEL    |     REPORT     |                                           COMMAND                                            #
###############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_o3/tools/lprof_npsu_run_8  #
###############################################################################################################################