options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14336)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.04887 +- 0.000001. Correct Result: 234.048872

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     592.08
Minimum kernel time:       0.00589174
Maximum kernel time:       0.00665916
Arithm. Mean kernel time:  0.00592068

Performance results        
Total GFlops/s:            2.44789
Minimum GFlops/s:          2.17648
Maximum GFlops/s:          2.45997
Arithm. Mean GFlops/s:     2.44795


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14336)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14591)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.59921 +- 0.000001. Correct Result: 233.599206

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     320.312
Minimum kernel time:       0.00316651
Maximum kernel time:       0.00430069
Arithm. Mean kernel time:  0.00320301

Performance results        
Total GFlops/s:            4.5248
Minimum GFlops/s:          3.37004
Maximum GFlops/s:          4.57712
Arithm. Mean GFlops/s:     4.52496


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14591)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14834)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.94766 +- 0.000001. Correct Result: 233.947659

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     208.684
Minimum kernel time:       0.00207006
Maximum kernel time:       0.00267596
Arithm. Mean kernel time:  0.00208676

Performance results        
Total GFlops/s:            6.9452
Minimum GFlops/s:          5.41619
Maximum GFlops/s:          7.00147
Arithm. Mean GFlops/s:     6.94546


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14834)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15070)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.17492 +- 0.000001. Correct Result: 234.174919

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     135.675
Minimum kernel time:       0.00134281
Maximum kernel time:       0.00191158
Arithm. Mean kernel time:  0.00135669

Performance results        
Total GFlops/s:            10.6825
Minimum GFlops/s:          7.58196
Maximum GFlops/s:          10.7934
Arithm. Mean GFlops/s:     10.683


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15070)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15306)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.56038 +- 0.000001. Correct Result: 233.560378

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     121.377
Minimum kernel time:       0.000676864
Maximum kernel time:       0.002936
Arithm. Mean kernel time:  0.00121371

Performance results        
Total GFlops/s:            11.9409
Minimum GFlops/s:          4.93647
Maximum GFlops/s:          21.4127
Arithm. Mean GFlops/s:     11.9415


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15306)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15551)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.47951 +- 0.000001. Correct Result: 234.479514

Configuration              
Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     51.365
Minimum kernel time:       0.000371371
Maximum kernel time:       0.00409776
Arithm. Mean kernel time:  0.000513581

Performance results        
Total GFlops/s:            28.2167
Minimum GFlops/s:          3.53693
Maximum GFlops/s:          39.027
Arithm. Mean GFlops/s:     28.2204


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15551)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15806)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.56348 +- 0.000001. Correct Result: 233.563482

Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     35.5334
Minimum kernel time:       0.00020388
Maximum kernel time:       0.00993908
Arithm. Mean kernel time:  0.000355245

Performance results        
Total GFlops/s:            40.7884
Minimum GFlops/s:          1.45823
Maximum GFlops/s:          71.0884
Arithm. Mean GFlops/s:     40.7986


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15806)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 16081)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.62510 +- 0.000001. Correct Result: 235.625101

Configuration              
Number of Threads:         104
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     7.89645
Minimum kernel time:       6.4954e-05
Maximum kernel time:       0.00669264
Arithm. Mean kernel time:  7.88913e-05

Performance results        
Total GFlops/s:            183.544
Minimum GFlops/s:          2.16559
Maximum GFlops/s:          223.135
Arithm. Mean GFlops/s:     183.715


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 16081)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7  #
##################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 16411)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.85291 +- 0.000001. Correct Result: 234.852910

Configuration              
Number of Threads:         208
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     9.13867
Minimum kernel time:       7.5579e-05
Maximum kernel time:       0.00740341
Arithm. Mean kernel time:  9.11083e-05

Performance results        
Total GFlops/s:            158.595
Minimum GFlops/s:          1.95768
Maximum GFlops/s:          191.766
Arithm. Mean GFlops/s:     159.08


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 16411)

Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8

To display your profiling results:
##################################################################################################################################
#    LEVEL    |     REPORT     |                                             COMMAND                                             #
##################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8  #
##################################################################################################################################

×