options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1662918)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.22918 +- 0.000001. Correct Result: 234.229181

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     434.459
Minimum kernel time:       0.00396379
Maximum kernel time:       0.00455897
Arithm. Mean kernel time:  0.00434454

Performance results        
Total GFlops/s:            3.33599
Minimum GFlops/s:          3.17912
Maximum GFlops/s:          3.65648
Arithm. Mean GFlops/s:     3.33602


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1662918)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1662918)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1662918)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1662918)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_0  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663086)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.79239 +- 0.000001. Correct Result: 233.792395

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     249.705
Minimum kernel time:       0.00238954
Maximum kernel time:       0.00452988
Arithm. Mean kernel time:  0.00249701

Performance results        
Total GFlops/s:            5.80426
Minimum GFlops/s:          3.19953
Maximum GFlops/s:          6.06538
Arithm. Mean GFlops/s:     5.80434


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663086)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663086)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663086)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663086)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_1  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663255)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.37530 +- 0.000001. Correct Result: 234.375303

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     126.032
Minimum kernel time:       0.00121816
Maximum kernel time:       0.00328189
Arithm. Mean kernel time:  0.00126028

Performance results        
Total GFlops/s:            11.4998
Minimum GFlops/s:          4.4162
Maximum GFlops/s:          11.8979
Arithm. Mean GFlops/s:     11.5002


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663255)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663255)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663255)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663255)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_2  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663411)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.24883 +- 0.000001. Correct Result: 233.248830

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     65.3614
Minimum kernel time:       0.000626184
Maximum kernel time:       0.00265997
Arithm. Mean kernel time:  0.000653578

Performance results        
Total GFlops/s:            22.1744
Minimum GFlops/s:          5.44875
Maximum GFlops/s:          23.1457
Arithm. Mean GFlops/s:     22.1756


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663411)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663411)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663411)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663411)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_3  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663572)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.84873 +- 0.000001. Correct Result: 234.848734

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     34.2205
Minimum kernel time:       0.000320757
Maximum kernel time:       0.00369247
Arithm. Mean kernel time:  0.000342165

Performance results        
Total GFlops/s:            42.3532
Minimum GFlops/s:          3.92515
Maximum GFlops/s:          45.1853
Arithm. Mean GFlops/s:     42.3582


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663572)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663572)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663572)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663572)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_4  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663741)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.12323 +- 0.000001. Correct Result: 234.123235

Configuration              
Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     18.5399
Minimum kernel time:       0.000166649
Maximum kernel time:       0.00374482
Arithm. Mean kernel time:  0.000185354

Performance results        
Total GFlops/s:            78.1747
Minimum GFlops/s:          3.87028
Maximum GFlops/s:          86.9702
Arithm. Mean GFlops/s:     78.1937


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663741)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663741)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663741)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663741)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_5  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1663925)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.84358 +- 0.000001. Correct Result: 234.843575

Configuration              
Number of Threads:         64
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     10.2528
Minimum kernel time:       9.03644e-05
Maximum kernel time:       0.00561976
Arithm. Mean kernel time:  0.000102489

Performance results        
Total GFlops/s:            141.361
Minimum GFlops/s:          2.57902
Maximum GFlops/s:          160.389
Arithm. Mean GFlops/s:     141.415


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1663925)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1663925)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1663925)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1663925)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_6  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1664141)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.00305 +- 0.000001. Correct Result: 235.003047

Configuration              
Number of Threads:         72
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     9.36641
Minimum kernel time:       8.21427e-05
Maximum kernel time:       0.0100872
Arithm. Mean kernel time:  9.36226e-05

Performance results        
Total GFlops/s:            154.739
Minimum GFlops/s:          1.43682
Maximum GFlops/s:          176.443
Arithm. Mean GFlops/s:     154.808


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1664141)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1664141)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1664141)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1664141)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_7  #
#########################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node p11-grace01.cs.it4i.cz

* Info: "ref-cycles" not supported on p11-grace01.cs.it4i.cz: fallback to "cpu-clock"
* Info: Process launched (host p11-grace01.cs.it4i.cz, process 1664365)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.69213 +- 0.000001. Correct Result: 233.692128

Configuration              
Number of Threads:         144
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     6.87415
Minimum kernel time:       5.78221e-05
Maximum kernel time:       0.0135659
Arithm. Mean kernel time:  6.87058e-05

Performance results        
Total GFlops/s:            210.841
Minimum GFlops/s:          1.06837
Maximum GFlops/s:          250.657
Arithm. Mean GFlops/s:     210.95


* Info: Process finished (host p11-grace01.cs.it4i.cz, process 1664365)
* Info: Dumping samples (host p11-grace01.cs.it4i.cz, process 1664365)
* Info: Dumping source info for callchain nodes (host p11-grace01.cs.it4i.cz, process 1664365)
* Info: Building/writing metadata (host p11-grace01.cs.it4i.cz)
* Info: Finished collect step (host p11-grace01.cs.it4i.cz, process 1664365)

Your experiment path is /home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/it4i-hugobol/pop3/epi-spmxv-main/spmxv_large_gcc/tools/lprof_npsu_run_8  #
#########################################################################################################################################

×