Run 1x1 | Number processes: 1Number nodes: 1Number processes per node: 1Run Command: <executable> MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/171-111-6305/intel/stream/run/oneview_runs/compilers/icx_10/oneview_run_1711118278I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 1x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x48 | OMP_NUM_THREADS: 48I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x96 | OMP_NUM_THREADS: 96I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 1x1 (%) | Coverage 1x2 (%) | Coverage 1x4 (%) | Coverage 1x8 (%) | Coverage 1x16 (%) | Coverage 1x32 (%) | Coverage 1x48 (%) | Coverage 1x96 (%) | Max Time Over Threads 1x1 (s) | Max Time Over Threads 1x2 (s) | Max Time Over Threads 1x4 (s) | Max Time Over Threads 1x8 (s) | Max Time Over Threads 1x16 (s) | Max Time Over Threads 1x32 (s) | Max Time Over Threads 1x48 (s) | Max Time Over Threads 1x96 (s) | Time w.r.t. Wall Time 1x1 (s) | Time w.r.t. Wall Time 1x2 (s) | Time w.r.t. Wall Time 1x4 (s) | Time w.r.t. Wall Time 1x8 (s) | Time w.r.t. Wall Time 1x16 (s) | Time w.r.t. Wall Time 1x32 (s) | Time w.r.t. Wall Time 1x48 (s) | Time w.r.t. Wall Time 1x96 (s) | Nb Threads 1x1 | Nb Threads 1x2 | Nb Threads 1x4 | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x48 | Nb Threads 1x96 | Deviation (coverage) 1x1 | Deviation (coverage) 1x2 | Deviation (coverage) 1x4 | Deviation (coverage) 1x8 | Deviation (coverage) 1x16 | Deviation (coverage) 1x32 | Deviation (coverage) 1x48 | Deviation (coverage) 1x96 | Deviation (walltime) 1x1 | Deviation (walltime) 1x2 | Deviation (walltime) 1x4 | Deviation (walltime) 1x8 | Deviation (walltime) 1x16 | Deviation (walltime) 1x32 | Deviation (walltime) 1x48 | Deviation (walltime) 1x96 | Categories 1x1 | Categories 1x2 | Categories 1x4 | Categories 1x8 | Categories 1x16 | Categories 1x32 | Categories 1x48 | Categories 1x96 | GFLOPS 1x1 | GFLOPS 1x2 | GFLOPS 1x4 | GFLOPS 1x8 | GFLOPS 1x16 | GFLOPS 1x32 | GFLOPS 1x48 | GFLOPS 1x96 | Compilation Options | (1x1) Efficiency | (1x1) Potential Speed-Up (%) | (1x2) Efficiency | (1x2) Potential Speed-Up (%) | (1x4) Efficiency | (1x4) Potential Speed-Up (%) | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x48) Efficiency | (1x48) Potential Speed-Up (%) | (1x96) Efficiency | (1x96) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►main.extracted.65 | exec | 27.12 | 27.1 | 25.41 | 25.45 | 25.6 | 26.81 | 27.34 | 28.45 | 120.33 | 60.27 | 32.03 | 16 | 8.36 | 5.02 | 4.33 | 4.3 | 120.33 | 60.29 | 31.29 | 15.91 | 8.47 | 5.36 | 4.72 | 4.65 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.24 | 0.99 | 0.78 | 0.79 | 0.53 | 0.45 | 0.36 | 0.00 | 0.25 | 1.12 | 0.39 | 0.20 | 0.06 | 0.03 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.43 | 2.85 | 5.50 | 10.81 | 20.31 | 32.09 | 36.45 | 37.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 1 | 0.06 | 0.96 | 0.98 | 0.95 | 1.39 | 0.89 | 2.87 | 0.7 | 8 | 0.53 | 12.82 | 0.27 | 20.78 |
○Loop 21 - stream.c:354-356 - exec | 27.12 | 27.1 | 25.41 | 25.45 | 25.6 | 26.81 | 27.34 | 28.45 | 120.33 | 60.27 | 32.03 | 16 | 8.36 | 5.02 | 4.33 | 4.3 | 120.33 | 60.29 | 31.29 | 15.91 | 8.47 | 5.36 | 4.72 | 4.65 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.24 | 0.99 | 0.78 | 0.79 | 0.53 | 0.45 | 0.36 | 0.00 | 0.25 | 1.12 | 0.39 | 0.20 | 0.06 | 0.03 | 0.04 | 1.43 | 2.85 | 5.50 | 10.81 | 20.31 | 32.09 | 36.45 | 37.00 | 1 | 0 | 1 | 0.06 | 0.96 | 0.98 | 0.95 | 1.39 | 0.89 | 2.87 | 0.7 | 8 | 0.53 | 12.82 | 0.27 | 20.78 | ||||||||||
►main.extracted.60 | exec | 27.08 | 27.02 | 25.37 | 25.41 | 25.54 | 26.45 | 27.32 | 28.63 | 120.18 | 60.06 | 31.98 | 16.02 | 8.28 | 4.96 | 4.35 | 4.3 | 120.17 | 60.12 | 31.24 | 15.89 | 8.46 | 5.28 | 4.71 | 4.68 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.21 | 0.99 | 0.78 | 0.75 | 0.55 | 0.44 | 0.34 | 0.00 | 0.19 | 1.11 | 0.39 | 0.18 | 0.07 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.72 | 1.43 | 2.75 | 5.42 | 10.18 | 16.29 | 18.30 | 18.38 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 1 | 0.02 | 0.96 | 0.97 | 0.95 | 1.39 | 0.89 | 2.87 | 0.71 | 7.64 | 0.53 | 12.8 | 0.27 | 20.97 |
○Loop 20 - stream.c:342-344 - exec | 27.08 | 27.02 | 25.37 | 25.41 | 25.54 | 26.45 | 27.32 | 28.63 | 120.18 | 60.06 | 31.98 | 16.02 | 8.28 | 4.96 | 4.35 | 4.3 | 120.17 | 60.12 | 31.24 | 15.89 | 8.46 | 5.28 | 4.71 | 4.68 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.21 | 0.99 | 0.78 | 0.75 | 0.55 | 0.44 | 0.34 | 0.00 | 0.19 | 1.11 | 0.39 | 0.18 | 0.07 | 0.03 | 0.03 | 0.72 | 1.43 | 2.75 | 5.42 | 10.18 | 16.29 | 18.30 | 18.38 | 1 | 0 | 1 | 0.02 | 0.96 | 0.97 | 0.95 | 1.39 | 0.89 | 2.87 | 0.71 | 7.64 | 0.53 | 12.8 | 0.27 | 20.97 | ||||||||||
►main.extracted.50 | exec | 22.48 | 22.34 | 21.81 | 21.81 | 21.53 | 20.96 | 20.73 | 19.72 | 99.73 | 49.56 | 28.45 | 14.19 | 7.24 | 4.06 | 3.35 | 2.96 | 99.73 | 49.7 | 26.85 | 13.63 | 7.13 | 4.19 | 3.58 | 3.23 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.09 | 1.78 | 1.24 | 1.15 | 0.88 | 0.57 | 0.25 | 0.00 | 0.04 | 2.09 | 0.69 | 0.33 | 0.14 | 0.07 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 1 | 0 | 0.93 | 1.56 | 0.91 | 1.86 | 0.87 | 2.71 | 0.74 | 5.37 | 0.58 | 8.7 | 0.32 | 13.38 |
○Loop 18 - stream.c:318-320 - exec | 22.48 | 22.34 | 21.81 | 21.81 | 21.53 | 20.96 | 20.73 | 19.72 | 99.73 | 49.56 | 28.45 | 14.19 | 7.24 | 4.06 | 3.35 | 2.96 | 99.73 | 49.7 | 26.85 | 13.63 | 7.13 | 4.19 | 3.58 | 3.23 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.09 | 1.78 | 1.24 | 1.15 | 0.88 | 0.57 | 0.25 | 0.00 | 0.04 | 2.09 | 0.69 | 0.33 | 0.14 | 0.07 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 0.93 | 1.56 | 0.91 | 1.86 | 0.87 | 2.71 | 0.74 | 5.37 | 0.58 | 8.7 | 0.32 | 13.38 | ||||||||||
►main.extracted.55 | exec | 22.22 | 22.1 | 21.54 | 21.6 | 21.25 | 20.52 | 20.45 | 19.76 | 98.6 | 49.04 | 28.08 | 14 | 7.15 | 3.97 | 3.3 | 2.98 | 98.6 | 49.17 | 26.52 | 13.5 | 7.03 | 4.1 | 3.53 | 3.23 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.08 | 1.73 | 1.21 | 1.15 | 0.83 | 0.52 | 0.27 | 0.00 | 0.05 | 2.04 | 0.67 | 0.33 | 0.13 | 0.07 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.87 | 1.75 | 3.24 | 6.37 | 12.23 | 20.98 | 24.31 | 26.63 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 1 | 0 | 0.93 | 1.52 | 0.91 | 1.88 | 0.88 | 2.62 | 0.75 | 5.1 | 0.58 | 8.55 | 0.32 | 13.48 |
○Loop 19 - stream.c:330-332 - exec | 22.22 | 22.1 | 21.54 | 21.6 | 21.25 | 20.52 | 20.45 | 19.76 | 98.6 | 49.04 | 28.08 | 14 | 7.15 | 3.97 | 3.3 | 2.98 | 98.6 | 49.17 | 26.52 | 13.5 | 7.03 | 4.1 | 3.53 | 3.23 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.08 | 1.73 | 1.21 | 1.15 | 0.83 | 0.52 | 0.27 | 0.00 | 0.05 | 2.04 | 0.67 | 0.33 | 0.13 | 0.07 | 0.02 | 0.87 | 1.75 | 3.24 | 6.37 | 12.23 | 20.98 | 24.31 | 26.63 | 1 | 0 | 1 | 0 | 0.93 | 1.52 | 0.91 | 1.88 | 0.88 | 2.62 | 0.75 | 5.1 | 0.58 | 8.55 | 0.32 | 13.48 | ||||||||||
►main.extracted.40 | exec | 0.69 | 0.69 | 0.67 | 0.68 | 0.66 | 0.58 | 0.48 | 0.37 | 3.06 | 1.54 | 0.87 | 0.43 | 0.22 | 0.12 | 0.09 | 0.06 | 3.06 | 1.54 | 0.82 | 0.43 | 0.22 | 0.12 | 0.08 | 0.06 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.00 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.00 | 0.01 | 0.06 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 0.99 | 0 | 0.93 | 0.04 | 0.89 | 0.08 | 0.87 | 0.09 | 0.8 | 0.12 | 0.8 | 0.1 | 0.53 | 0.17 |
○Loop 16 - stream.c:271-275 - exec | 0.69 | 0.69 | 0.67 | 0.68 | 0.66 | 0.58 | 0.48 | 0.37 | 3.06 | 1.54 | 0.87 | 0.43 | 0.22 | 0.12 | 0.09 | 0.06 | 3.06 | 1.54 | 0.82 | 0.43 | 0.22 | 0.12 | 0.08 | 0.06 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.00 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.00 | 0.01 | 0.06 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.99 | 0 | 0.93 | 0.04 | 0.89 | 0.08 | 0.87 | 0.09 | 0.8 | 0.12 | 0.8 | 0.1 | 0.53 | 0.17 | ||||||||||
►main | exec | 0.27 | 0.38 | 0.36 | 0.38 | 0.37 | 0.32 | 0.24 | 0.13 | 1.21 | 1.66 | 1.78 | 1.87 | 1.86 | 1.86 | 1.84 | 1.85 | 1.21 | 0.84 | 0.45 | 0.24 | 0.12 | 0.06 | 0.04 | 0.02 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.26 | 6.14 | 11.47 | 21.50 | 43.00 | 86.00 | 129.00 | 258.01 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 0.72 | 0.11 | 0.67 | 0.12 | 0.63 | 0.14 | 0.63 | 0.14 | 0.63 | 0.12 | 0.63 | 0.09 | 0.63 | 0.05 |
○Loop 12 - stream.c:476-479 - exec | 0.27 | 0.38 | 0.36 | 0.38 | 0.37 | 0.32 | 0.24 | 0.13 | 1.21 | 1.66 | 1.78 | 1.87 | 1.86 | 1.86 | 1.84 | 1.85 | 1.21 | 0.84 | 0.45 | 0.24 | 0.12 | 0.06 | 0.04 | 0.02 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.26 | 6.14 | 11.47 | 21.50 | 43.00 | 86.00 | 129.00 | 258.01 | 1 | 0 | 0.72 | 0.11 | 0.67 | 0.12 | 0.63 | 0.14 | 0.63 | 0.14 | 0.63 | 0.12 | 0.63 | 0.09 | 0.63 | 0.05 | ||||||||||
○Loop 6 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 11 - stream.c:384-469 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 10 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 5 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 15 - stream.c:405-504 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 14 - stream.c:405-523 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 3 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
►Loop 1 - stream.c:405-441 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 0 - stream.c:407-441 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 8 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 9 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 2 - stream.c:306-441 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 7 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 4 - stream.c:366-405 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 13 - stream.c:405-542 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
►main.extracted.45 | exec | 0.09 | 0.09 | 0.08 | 0.08 | 0.11 | 0.16 | 0.18 | 0.19 | 0.41 | 0.21 | 0.11 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 0.41 | 0.21 | 0.1 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.10 | 4.08 | 8.48 | 16.64 | 20.80 | 25.60 | 25.60 | 25.60 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -O3 -D NTIMES=100 -D STREAM_ARRAY_SIZE=860160000 -O3 -x SAPPHIRERAPIDS -mprefer-vector-width=512 -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -ffreest... | 1 | 0 | 0.98 | 0 | 1.02 | -0 | 1.02 | -0 | 0.64 | 0.04 | 0.43 | 0.09 | 0.28 | 0.13 | 0.14 | 0.16 |
○Loop 17 - stream.c:290-292 - exec | 0.09 | 0.09 | 0.08 | 0.08 | 0.11 | 0.16 | 0.18 | 0.19 | 0.41 | 0.21 | 0.11 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 0.41 | 0.21 | 0.1 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.10 | 4.08 | 8.48 | 16.64 | 20.80 | 25.60 | 25.60 | 25.60 | 1 | 0 | 0.98 | 0 | 1.02 | -0 | 1.02 | -0 | 0.64 | 0.04 | 0.43 | 0.09 | 0.28 | 0.13 | 0.14 | 0.16 | ||||||||||
○unknown_kernel_region | kernel | 0.04 | 0.05 | 0.07 | 0.07 | 0.06 | 0.08 | 0.09 | 0.2 | 0.19 | 0.12 | 0.1 | 0.06 | 0.03 | 0.04 | 0.04 | 0.06 | 0.19 | 0.12 | 0.09 | 0.04 | 0.02 | 0.02 | 0.02 | 0.03 | 1 | 2 | 4 | 8 | 16 | 31 | 43 | 96 | 0.00 | 0.00 | 0.01 | 0.03 | 0.03 | 0.05 | 0.07 | 0.08 | 0.00 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 99.83 OMP (%): 0.17 | 0.04 | 0.03 | 0.04 | 0.10 | 0.20 | 0.40 | 0.80 | 0.13 | 1 | 0 | 0.79 | 0.01 | 0.53 | 0.03 | 0.59 | 0.03 | 0.59 | 0.02 | 0.3 | 0.06 | 0.2 | 0.07 | 0.07 | 0.19 | |
○_ZN17_INTERNAL021345c126__kmp_hyper_barrier_gatherE12barrier_typeP8kmp_infoiiPFvPvS3_ES3_..0 | libiomp5.so | 0 | 0.14 | 2.24 | 2.19 | 0.72 | 0.37 | 0.25 | 0.1 | 0 | 0.61 | 10.92 | 5.48 | 2.71 | 1.18 | 0.46 | 0.23 | 0 | 0.31 | 2.75 | 1.37 | 0.24 | 0.07 | 0.04 | 0.02 | 0 | 1 | 1 | 2 | 4 | 6 | 13 | 24 | 0.00 | 0.00 | 0.00 | 0.09 | 3.85 | 2.82 | 0.96 | 0.42 | 0.00 | 0.00 | 0.00 | 0.16 | 1.27 | 0.53 | 0.16 | 0.06 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__GI___sched_yield | libc.so.6 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 3 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | NA | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||
○_ZN17_INTERNAL021345c119__kmp_wait_templateI11kmp_flag_64ILb0ELb1EELb1ELb0ELb1EEEbP8kmp_infoPT_Pv | libiomp5.so | 0 | 0.07 | 2.38 | 2.25 | 4.09 | 3.69 | 2.86 | 2.4 | 0 | 0.31 | 10.98 | 3.14 | 3.29 | 1.65 | 0.86 | 0.49 | 0 | 0.16 | 2.93 | 1.4 | 1.35 | 0.74 | 0.49 | 0.39 | 0 | 1 | 3 | 7 | 15 | 31 | 47 | 95 | 0.00 | 0.00 | 5.06 | 2.05 | 2.92 | 2.03 | 1.14 | 0.46 | 0.00 | 0.00 | 6.15 | 1.25 | 0.92 | 0.37 | 0.18 | 0.07 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |