options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 712613)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.14376 +- 0.000001. Correct Result: 235.143764

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     1086.96
Minimum kernel time:       0.0106314
Maximum kernel time:       0.0116609
Arithm. Mean kernel time:  0.0108695

Performance results        
Total GFlops/s:            1.33339
Minimum GFlops/s:          1.24292
Maximum GFlops/s:          1.36327
Arithm. Mean GFlops/s:     1.33341


* Info: Process finished (host skylake, process 712613)
* Info: Dumping samples (host skylake, process 712613)
* Info: Dumping source info for callchain nodes (host skylake, process 712613)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712613)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 712749)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.20245 +- 0.000001. Correct Result: 233.202453

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     561.626
Minimum kernel time:       0.00550001
Maximum kernel time:       0.0079923
Arithm. Mean kernel time:  0.00561613

Performance results        
Total GFlops/s:            2.58063
Minimum GFlops/s:          1.81343
Maximum GFlops/s:          2.63517
Arithm. Mean GFlops/s:     2.58069


* Info: Process finished (host skylake, process 712749)
* Info: Dumping samples (host skylake, process 712749)
* Info: Dumping source info for callchain nodes (host skylake, process 712749)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712749)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 712850)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.42015 +- 0.000001. Correct Result: 234.420152

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     292.985
Minimum kernel time:       0.00278983
Maximum kernel time:       0.0221535
Arithm. Mean kernel time:  0.00292976

Performance results        
Total GFlops/s:            4.94684
Minimum GFlops/s:          0.654231
Maximum GFlops/s:          5.19512
Arithm. Mean GFlops/s:     4.947


* Info: Process finished (host skylake, process 712850)
* Info: Dumping samples (host skylake, process 712850)
* Info: Dumping source info for callchain nodes (host skylake, process 712850)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712850)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 712938)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.66557 +- 0.000001. Correct Result: 233.665565

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     156.983
Minimum kernel time:       0.00147383
Maximum kernel time:       0.00450785
Arithm. Mean kernel time:  0.0015697

Performance results        
Total GFlops/s:            9.23255
Minimum GFlops/s:          3.21517
Maximum GFlops/s:          9.8339
Arithm. Mean GFlops/s:     9.23327


* Info: Process finished (host skylake, process 712938)
* Info: Dumping samples (host skylake, process 712938)
* Info: Dumping source info for callchain nodes (host skylake, process 712938)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712938)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 713023)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.58561 +- 0.000001. Correct Result: 234.585612

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     86.2404
Minimum kernel time:       0.00082339
Maximum kernel time:       0.00680713
Arithm. Mean kernel time:  0.000862338

Performance results        
Total GFlops/s:            16.8059
Minimum GFlops/s:          2.12916
Maximum GFlops/s:          17.6022
Arithm. Mean GFlops/s:     16.8072


* Info: Process finished (host skylake, process 713023)
* Info: Dumping samples (host skylake, process 713023)
* Info: Dumping source info for callchain nodes (host skylake, process 713023)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713023)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 713108)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.05586 +- 0.000001. Correct Result: 234.055865

Configuration              
Number of Threads:         26
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     70.8301
Minimum kernel time:       0.000689473
Maximum kernel time:       0.0041108
Arithm. Mean kernel time:  0.000708186

Performance results        
Total GFlops/s:            20.4623
Minimum GFlops/s:          3.52571
Maximum GFlops/s:          21.0211
Arithm. Mean GFlops/s:     20.4657


* Info: Process finished (host skylake, process 713108)
* Info: Dumping samples (host skylake, process 713108)
* Info: Dumping source info for callchain nodes (host skylake, process 713108)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713108)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5  #
###############################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 713205)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.71083 +- 0.000001. Correct Result: 233.710830

Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     33.2772
Minimum kernel time:       0.000286154
Maximum kernel time:       0.0131419
Arithm. Mean kernel time:  0.000332461

Performance results        
Total GFlops/s:            43.5538
Minimum GFlops/s:          1.10285
Maximum GFlops/s:          50.6492
Arithm. Mean GFlops/s:     43.5946


* Info: Process finished (host skylake, process 713205)
* Info: Dumping samples (host skylake, process 713205)
* Info: Dumping source info for callchain nodes (host skylake, process 713205)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713205)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6

To display your profiling results:
###############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                    #
###############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6  #
###############################################################################################################################################

×