Run 1x1 | Number processes: 1Number nodes: 1Number processes per node: 1Run Command: <executable> MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /scratch_na/users/xoserete/qaas_runs/171-317-5776/intel/HACCmk/run/oneview_runs/compilers/icx_5/oneview_run_1713179458I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 1x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x64 | OMP_NUM_THREADS: 64I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x112 | OMP_NUM_THREADS: 112I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 1x1 (%) | Coverage 1x2 (%) | Coverage 1x4 (%) | Coverage 1x8 (%) | Coverage 1x16 (%) | Coverage 1x32 (%) | Coverage 1x64 (%) | Coverage 1x112 (%) | Max Time Over Threads 1x1 (s) | Max Time Over Threads 1x2 (s) | Max Time Over Threads 1x4 (s) | Max Time Over Threads 1x8 (s) | Max Time Over Threads 1x16 (s) | Max Time Over Threads 1x32 (s) | Max Time Over Threads 1x64 (s) | Max Time Over Threads 1x112 (s) | Time w.r.t. Wall Time 1x1 (s) | Time w.r.t. Wall Time 1x2 (s) | Time w.r.t. Wall Time 1x4 (s) | Time w.r.t. Wall Time 1x8 (s) | Time w.r.t. Wall Time 1x16 (s) | Time w.r.t. Wall Time 1x32 (s) | Time w.r.t. Wall Time 1x64 (s) | Time w.r.t. Wall Time 1x112 (s) | Nb Threads 1x1 | Nb Threads 1x2 | Nb Threads 1x4 | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x64 | Nb Threads 1x112 | Deviation (coverage) 1x1 | Deviation (coverage) 1x2 | Deviation (coverage) 1x4 | Deviation (coverage) 1x8 | Deviation (coverage) 1x16 | Deviation (coverage) 1x32 | Deviation (coverage) 1x64 | Deviation (coverage) 1x112 | Deviation (walltime) 1x1 | Deviation (walltime) 1x2 | Deviation (walltime) 1x4 | Deviation (walltime) 1x8 | Deviation (walltime) 1x16 | Deviation (walltime) 1x32 | Deviation (walltime) 1x64 | Deviation (walltime) 1x112 | Categories 1x1 | Categories 1x2 | Categories 1x4 | Categories 1x8 | Categories 1x16 | Categories 1x32 | Categories 1x64 | Categories 1x112 | Compilation Options | (1x1) Efficiency | (1x1) Potential Speed-Up (%) | (1x2) Efficiency | (1x2) Potential Speed-Up (%) | (1x4) Efficiency | (1x4) Potential Speed-Up (%) | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x64) Efficiency | (1x64) Potential Speed-Up (%) | (1x112) Efficiency | (1x112) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►Step10_orig | exec | 99.96 | 99.78 | 99.66 | 99.44 | 98.82 | 97.74 | 95.93 | 92.63 | 1984.05 | 994.31 | 497.37 | 248.96 | 124.79 | 62.5 | 31.35 | 18.13 | 1984.05 | 994.07 | 497.15 | 248.78 | 124.41 | 62.22 | 31.11 | 17.83 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.04 | 0.05 | 0.04 | 0.27 | 0.44 | 0.74 | 1.43 | 0.00 | 0.39 | 0.37 | 0.13 | 0.36 | 0.29 | 0.24 | 0.27 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --intel -I /scratch_na/users/xoserete/qaas_runs/171-317-5776/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /scratch_na/users/xoserete/qaas_runs/171-317-5776/intel/HACCmk/build/icx_... | 1 | 0 | 1 | 0.21 | 1 | 0.23 | 1 | 0.31 | 1 | 0.32 | 1 | 0.34 | 1 | 0.34 | 0.99 | 0.6 |
○Loop 5 - Step10_orig.c:19-35 - exec | 99.94 | 99.76 | 99.64 | 99.42 | 98.81 | 97.72 | 95.91 | 92.61 | 1983.63 | 994.09 | 497.27 | 248.88 | 124.77 | 62.48 | 31.35 | 18.13 | 1983.63 | 993.86 | 497.05 | 248.73 | 124.39 | 62.21 | 31.1 | 17.83 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.04 | 0.05 | 0.04 | 0.27 | 0.44 | 0.74 | 1.43 | 0.00 | 0.37 | 0.36 | 0.13 | 0.36 | 0.29 | 0.24 | 0.27 | 1 | 0 | 1 | 0.21 | 1 | 0.23 | 1 | 0.31 | 1 | 0.33 | 1 | 0.35 | 1 | 0.33 | 0.99 | 0.62 | ||||||||||
○unknown_kernel_region | kernel | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.03 | 0.3 | 0.14 | 0.07 | 0.02 | 0.04 | 0.03 | 0.02 | 0.02 | 0.3 | 0.14 | 0.06 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 1 | 2 | 4 | 8 | 16 | 27 | 46 | 70 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 98.44 OMP (%): 1.56 | System (%): 100.00 | System (%): 98.88 OMP (%): 1.12 | System (%): 98.18 OMP (%): 1.82 | 1 | 0 | 1.07 | -0 | 1.25 | -0 | 1.88 | 0 | 0.94 | 0 | 0.94 | 0 | 0.47 | 0.01 | 1 | 0 | |
►__intel_avx_rep_memset | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.14 | 0.15 | 0.18 | 0.15 | 0.19 | 0.18 | 0.18 | 0.16 | 0.14 | 0.07 | 0.05 | 0.02 | 0.01 | 0.01 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 1 | 0 | 1 | 0 | 0.7 | 0 | 0.88 | 0 | 0.88 | 0 | 0.44 | 0.01 | 1 | 0 | 1 | 0 | |
○Loop 7 - - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
►main | exec | 0.01 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.19 | 0.65 | 0.88 | 0.83 | 0.77 | 0.86 | 0.81 | 0.8 | 0.19 | 0.33 | 0.22 | 0.1 | 0.05 | 0.03 | 0.01 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --intel -I /scratch_na/users/xoserete/qaas_runs/171-317-5776/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /scratch_na/users/xoserete/qaas_runs/171-317-5776/intel/HACCmk/build/icx_... | 1 | 0 | 0.29 | 0.02 | 0.22 | 0.03 | 0.24 | 0.03 | 0.24 | 0.03 | 0.2 | 0.03 | 0.3 | 0.03 | 0.17 | 0.03 |
►Loop 2 - main.c:77-169 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 1 - main.c:111-116 - exec | 0.01 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.19 | 0.65 | 0.87 | 0.83 | 0.77 | 0.86 | 0.81 | 0.8 | 0.19 | 0.33 | 0.22 | 0.1 | 0.05 | 0.03 | 0.01 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.29 | 0.02 | 0.22 | 0.03 | 0.24 | 0.03 | 0.24 | 0.03 | 0.2 | 0.03 | 0.3 | 0.03 | 0.17 | 0.03 | ||||||||||
○Loop 3 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 0 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 0 | 0.16 | 0.26 | 0.49 | 1.08 | 2.13 | 3.9 | 7.12 | 0 | 1.67 | 1.65 | 1.53 | 1.98 | 1.94 | 1.91 | 1.99 | 0 | 1.56 | 1.32 | 1.21 | 1.36 | 1.36 | 1.26 | 1.37 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.07 | 0.13 | 0.35 | 0.53 | 0.85 | 1.54 | 0.00 | 0.16 | 0.33 | 0.33 | 0.44 | 0.34 | 0.27 | 0.30 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |