options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9107)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.03761 +- 0.000001. Correct Result: 235.037611

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     590.411
Minimum kernel time:       0.00587606
Maximum kernel time:       0.00658798
Arithm. Mean kernel time:  0.00590402

Performance results        
Total GFlops/s:            2.45481
Minimum GFlops/s:          2.19999
Maximum GFlops/s:          2.46653
Arithm. Mean GFlops/s:     2.45485


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9107)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9359)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.09150 +- 0.000001. Correct Result: 234.091499

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     320.897
Minimum kernel time:       0.00317192
Maximum kernel time:       0.00428486
Arithm. Mean kernel time:  0.00320889

Performance results        
Total GFlops/s:            4.51656
Minimum GFlops/s:          3.38249
Maximum GFlops/s:          4.56931
Arithm. Mean GFlops/s:     4.51667


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9359)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9606)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.94635 +- 0.000001. Correct Result: 234.946347

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     208.941
Minimum kernel time:       0.0020709
Maximum kernel time:       0.00258303
Arithm. Mean kernel time:  0.00208934

Performance results        
Total GFlops/s:            6.93665
Minimum GFlops/s:          5.61105
Maximum GFlops/s:          6.99863
Arithm. Mean GFlops/s:     6.93687


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9606)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9841)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.57306 +- 0.000001. Correct Result: 234.573063

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     136.976
Minimum kernel time:       0.00135493
Maximum kernel time:       0.00198197
Arithm. Mean kernel time:  0.00136969

Performance results        
Total GFlops/s:            10.5811
Minimum GFlops/s:          7.31266
Maximum GFlops/s:          10.6968
Arithm. Mean GFlops/s:     10.5816


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9841)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10081)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.26021 +- 0.000001. Correct Result: 233.260206

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     78.9088
Minimum kernel time:       0.000714779
Maximum kernel time:       0.00503993
Arithm. Mean kernel time:  0.000788997

Performance results        
Total GFlops/s:            18.3674
Minimum GFlops/s:          2.87573
Maximum GFlops/s:          20.2769
Arithm. Mean GFlops/s:     18.3695


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10081)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10322)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.30691 +- 0.000001. Correct Result: 235.306908

Configuration              
Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     52.0227
Minimum kernel time:       0.000365019
Maximum kernel time:       0.00592399
Arithm. Mean kernel time:  0.00052011

Performance results        
Total GFlops/s:            27.8599
Minimum GFlops/s:          2.44658
Maximum GFlops/s:          39.7062
Arithm. Mean GFlops/s:     27.8662


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10322)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10578)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.59948 +- 0.000001. Correct Result: 234.599480

Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     27.3909
Minimum kernel time:       0.000200033
Maximum kernel time:       0.0037992
Arithm. Mean kernel time:  0.000273877

Performance results        
Total GFlops/s:            52.9135
Minimum GFlops/s:          3.81488
Maximum GFlops/s:          72.4554
Arithm. Mean GFlops/s:     52.9197


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10578)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10853)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.40563 +- 0.000001. Correct Result: 234.405628

Configuration              
Number of Threads:         104
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     5.8824
Minimum kernel time:       4.60148e-05
Maximum kernel time:       0.0122659
Arithm. Mean kernel time:  5.87934e-05

Performance results        
Total GFlops/s:            246.388
Minimum GFlops/s:          1.18161
Maximum GFlops/s:          314.975
Arithm. Mean GFlops/s:     246.516


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10853)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7  #
############################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com

* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 11179)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.27556 +- 0.000001. Correct Result: 234.275559

Configuration              
Number of Threads:         208
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     5.22391
Minimum kernel time:       3.88622e-05
Maximum kernel time:       0.012706
Arithm. Mean kernel time:  5.22014e-05

Performance results        
Total GFlops/s:            277.445
Minimum GFlops/s:          1.14068
Maximum GFlops/s:          372.946
Arithm. Mean GFlops/s:     277.646


* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 11179)

Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8

To display your profiling results:
############################################################################################################################
#    LEVEL    |     REPORT     |                                          COMMAND                                          #
############################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8  #
############################################################################################################################

×