* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 2816)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.66672 +- 0.000001. Correct Result: 234.666724
Configuration
Number of Threads: 1
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 557.197
Minimum kernel time: 0.00554326
Maximum kernel time: 0.00610639
Arithm. Mean kernel time: 0.00557187
Performance results
Total GFlops/s: 2.60114
Minimum GFlops/s: 2.3735
Maximum GFlops/s: 2.61462
Arithm. Mean GFlops/s: 2.60119
* Info: Process finished (host ip-172-31-42-13, process 2816)
* Info: Dumping samples (host ip-172-31-42-13, process 2816)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 2816)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 2816)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_0 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 2978)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.89722 +- 0.000001. Correct Result: 233.897216
Configuration
Number of Threads: 2
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 353.862
Minimum kernel time: 0.00347217
Maximum kernel time: 0.00849633
Arithm. Mean kernel time: 0.0035385
Performance results
Total GFlops/s: 4.0958
Minimum GFlops/s: 1.70585
Maximum GFlops/s: 4.17419
Arithm. Mean GFlops/s: 4.09594
* Info: Process finished (host ip-172-31-42-13, process 2978)
* Info: Dumping samples (host ip-172-31-42-13, process 2978)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 2978)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 2978)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_1 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 3065)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.81876 +- 0.000001. Correct Result: 234.818758
Configuration
Number of Threads: 4
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 191.928
Minimum kernel time: 0.00188861
Maximum kernel time: 0.006898
Arithm. Mean kernel time: 0.00191917
Performance results
Total GFlops/s: 7.55153
Minimum GFlops/s: 2.10112
Maximum GFlops/s: 7.67416
Arithm. Mean GFlops/s: 7.55197
* Info: Process finished (host ip-172-31-42-13, process 3065)
* Info: Dumping samples (host ip-172-31-42-13, process 3065)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 3065)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 3065)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_2 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 3186)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.08047 +- 0.000001. Correct Result: 234.080470
Configuration
Number of Threads: 8
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 112.809
Minimum kernel time: 0.0011082
Maximum kernel time: 0.00598793
Arithm. Mean kernel time: 0.00112795
Performance results
Total GFlops/s: 12.8478
Minimum GFlops/s: 2.42045
Maximum GFlops/s: 13.0784
Arithm. Mean GFlops/s: 12.8495
* Info: Process finished (host ip-172-31-42-13, process 3186)
* Info: Dumping samples (host ip-172-31-42-13, process 3186)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 3186)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 3186)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_3 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 3275)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.62014 +- 0.000001. Correct Result: 234.620139
Configuration
Number of Threads: 16
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 86.429
Minimum kernel time: 0.000848328
Maximum kernel time: 0.00559589
Arithm. Mean kernel time: 0.000864176
Performance results
Total GFlops/s: 16.7692
Minimum GFlops/s: 2.59003
Maximum GFlops/s: 17.0848
Arithm. Mean GFlops/s: 16.7715
* Info: Process finished (host ip-172-31-42-13, process 3275)
* Info: Dumping samples (host ip-172-31-42-13, process 3275)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 3275)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 3275)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_4 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 3372)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.48921 +- 0.000001. Correct Result: 233.489209
Configuration
Number of Threads: 32
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 54.3523
Minimum kernel time: 0.000504752
Maximum kernel time: 0.00543353
Arithm. Mean kernel time: 0.000543454
Performance results
Total GFlops/s: 26.6658
Minimum GFlops/s: 2.66742
Maximum GFlops/s: 28.7141
Arithm. Mean GFlops/s: 26.6692
* Info: Process finished (host ip-172-31-42-13, process 3372)
* Info: Dumping samples (host ip-172-31-42-13, process 3372)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 3372)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 3372)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_5 #
#####################################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-42-13
* Info: "ref-cycles" not supported on ip-172-31-42-13: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-42-13, process 3484)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.67882 +- 0.000001. Correct Result: 233.678816
Configuration
Number of Threads: 64
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 24.648
Minimum kernel time: 0.000229732
Maximum kernel time: 0.0080629
Arithm. Mean kernel time: 0.000246436
Performance results
Total GFlops/s: 58.8018
Minimum GFlops/s: 1.79755
Maximum GFlops/s: 63.0887
Arithm. Mean GFlops/s: 58.8123
* Info: Process finished (host ip-172-31-42-13, process 3484)
* Info: Dumping samples (host ip-172-31-42-13, process 3484)
* Info: Dumping source info for callchain nodes (host ip-172-31-42-13, process 3484)
* Info: Building/writing metadata (host ip-172-31-42-13)
* Info: Finished collect step (host ip-172-31-42-13, process 3484)
Your experiment path is /home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6
To display your profiling results:
#####################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/hbollore/pop3/spmxv/epi-spmxv-main/spmxv_large_g3e_gcc_armpl/tools/lprof_npsu_run_6 #
#####################################################################################################################################################