Run 1x8 | Number processes: 1Number nodes: 1Run Command: <executable> MPI Command: mpirun -n <number_processes>Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/run/oneview_runs/multicore/icx_3/oneview_run_1722906572OMP_PROC_BIND: spreadOMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
---|---|
Run 1x16 | Number processes: 1OMP_NUM_THREADS: 16OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x32 | Number processes: 1OMP_NUM_THREADS: 32OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x64 | Number processes: 1OMP_NUM_THREADS: 64OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x128 | Number processes: 1OMP_NUM_THREADS: 128OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x192 | Number processes: 1OMP_NUM_THREADS: 192OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Name | Module | Exclusive Coverage 1x8 (%) | Exclusive Coverage 1x16 (%) | Exclusive Coverage 1x32 (%) | Exclusive Coverage 1x64 (%) | Exclusive Coverage 1x128 (%) | Exclusive Coverage 1x192 (%) | Inclusive Coverage 1x8 (%) | Inclusive Coverage 1x16 (%) | Inclusive Coverage 1x32 (%) | Inclusive Coverage 1x64 (%) | Inclusive Coverage 1x128 (%) | Inclusive Coverage 1x192 (%) | Max Exclusive Time Over Threads 1x8 (s) | Max Exclusive Time Over Threads 1x16 (s) | Max Exclusive Time Over Threads 1x32 (s) | Max Exclusive Time Over Threads 1x64 (s) | Max Exclusive Time Over Threads 1x128 (s) | Max Exclusive Time Over Threads 1x192 (s) | Max Inclusive Time Over Threads 1x8 (s) | Max Inclusive Time Over Threads 1x16 (s) | Max Inclusive Time Over Threads 1x32 (s) | Max Inclusive Time Over Threads 1x64 (s) | Max Inclusive Time Over Threads 1x128 (s) | Max Inclusive Time Over Threads 1x192 (s) | Exclusive Time w.r.t. Wall Time 1x8 (s) | Exclusive Time w.r.t. Wall Time 1x16 (s) | Exclusive Time w.r.t. Wall Time 1x32 (s) | Exclusive Time w.r.t. Wall Time 1x64 (s) | Exclusive Time w.r.t. Wall Time 1x128 (s) | Exclusive Time w.r.t. Wall Time 1x192 (s) | Inclusive Time w.r.t. Wall Time 1x8 (s) | Inclusive Time w.r.t. Wall Time 1x16 (s) | Inclusive Time w.r.t. Wall Time 1x32 (s) | Inclusive Time w.r.t. Wall Time 1x64 (s) | Inclusive Time w.r.t. Wall Time 1x128 (s) | Inclusive Time w.r.t. Wall Time 1x192 (s) | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x64 | Nb Threads 1x128 | Nb Threads 1x192 | Deviation (coverage) 1x8 | Deviation (coverage) 1x16 | Deviation (coverage) 1x32 | Deviation (coverage) 1x64 | Deviation (coverage) 1x128 | Deviation (coverage) 1x192 | Deviation (walltime) 1x8 | Deviation (walltime) 1x16 | Deviation (walltime) 1x32 | Deviation (walltime) 1x64 | Deviation (walltime) 1x128 | Deviation (walltime) 1x192 | Categories 1x8 | Categories 1x16 | Categories 1x32 | Categories 1x64 | Categories 1x128 | Categories 1x192 | GFLOPS 1x8 | GFLOPS 1x16 | GFLOPS 1x32 | GFLOPS 1x64 | GFLOPS 1x128 | GFLOPS 1x192 | Compilation Options | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x64) Efficiency | (1x64) Potential Speed-Up (%) | (1x128) Efficiency | (1x128) Potential Speed-Up (%) | (1x192) Efficiency | (1x192) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►Step10_orig | exec | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 97.09 | 94.73 | 88.97 | 80.05 | 67.71 | 55.94 | 0.07 | 0.04 | 0.03 | 0.02 | 0.01 | 0.01 | 93.55 | 46.78 | 24.06 | 12.47 | 6.92 | 4.98 | 0.04 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 93.53 | 46.73 | 23.82 | 12.25 | 6.66 | 4.40 | 8 | 16 | 32 | 64 | 128 | 192 | 0.14 | 0.29 | 0.90 | 1.46 | 3.25 | 3.44 | 0.11 | 0.11 | 0.24 | 0.23 | 0.32 | 0.28 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 46.10 | 92.27 | 181.01 | 351.86 | 647.32 | 979.72 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) /cluster/intel/oneapi/2024.0.0/compiler/2024.0/bin/compiler/clang --intel -I /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eosere... | 1 | 0 | 1 | -0 | 0.98 | 0 | 0.95 | 0 | 0.88 | 0 | 0.89 | 0 |
○Loop 5 - Step10_orig.c:19-35 - exec | 97.04 | 94.69 | 88.94 | 80.03 | 67.68 | 55.92 | 97.04 | 94.69 | 88.94 | 80.03 | 67.68 | 55.92 | 93.50 | 46.76 | 24.05 | 12.47 | 6.92 | 4.98 | 93.50 | 46.76 | 24.05 | 12.47 | 6.92 | 4.98 | 93.49 | 46.71 | 23.81 | 12.25 | 6.66 | 4.40 | 93.49 | 46.71 | 23.81 | 12.25 | 6.66 | 4.40 | 8 | 16 | 32 | 64 | 128 | 192 | 0.14 | 0.29 | 0.89 | 1.45 | 3.25 | 3.44 | 0.11 | 0.11 | 0.24 | 0.23 | 0.32 | 0.28 | 46.11 | 92.29 | 181.03 | 351.88 | 647.33 | 979.82 | 1 | 0 | 1 | 0 | 0.98 | 1.63 | 0.95 | 3.68 | 0.88 | 8.28 | 0.89 | 6.4 | ||||||||
○bool _INTERNAL021345c1::__kmp_wait_template<kmp_flag_64<false, true>, true, false, true>(kmp_info*, kmp_flag_64<false, true>*, void*) | libiomp5.so | 2.39 | 4.62 | 10.09 | 18.70 | 30.55 | 42.07 | 2.39 | 4.62 | 10.09 | 18.70 | 30.55 | 42.07 | 2.74 | 2.55 | 3.13 | 3.29 | 3.71 | 3.75 | 2.74 | 2.55 | 3.13 | 3.29 | 3.71 | 3.75 | 2.30 | 2.28 | 2.70 | 2.86 | 3.00 | 3.31 | 2.30 | 2.28 | 2.70 | 2.86 | 3.00 | 3.31 | 7 | 15 | 31 | 63 | 127 | 191 | 0.11 | 0.18 | 0.72 | 1.33 | 2.97 | 3.33 | 0.11 | 0.09 | 0.19 | 0.20 | 0.28 | 0.25 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.5 | 0 | 0.21 | 0 | 0.1 | 0 | 0.05 | 0 | 0.03 | 0 | |
►main | exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.59 | 2.69 | 2.78 | 2.64 | 2.84 | 2.81 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.55 | 2.77 | 5.31 | 12.12 | 21.04 | 30.60 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) /cluster/intel/oneapi/2024.0.0/compiler/2024.0/bin/compiler/clang --intel -I /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eosere... | 1 | 0 | 0.96 | 0 | 0.93 | 0 | 0.97 | 0 | 0.9 | 0 | 0.9 | 0 |
►Loop 2 - main.c:77-169 - exec [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.59 | 2.69 | 2.78 | 2.64 | 2.84 | 2.81 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 0 - main.c:111-116 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 3 - main.c:111-116 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 1 - main.c:111-116 - exec | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 2.59 | 2.69 | 2.78 | 2.64 | 2.83 | 2.81 | 2.59 | 2.69 | 2.78 | 2.64 | 2.83 | 2.81 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.55 | 2.77 | 5.31 | 12.12 | 21.07 | 30.51 | 1 | 0 | 0.96 | 0.01 | 0.93 | 0.02 | 0.97 | 0.01 | 0.9 | 0.02 | 0.9 | 0.02 | ||||||||
○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 0.06 | 0.11 | 0.24 | 0.43 | 0.71 | 0.96 | 0.06 | 0.11 | 0.24 | 0.43 | 0.71 | 0.96 | 0.11 | 0.08 | 0.10 | 0.11 | 0.12 | 0.15 | 0.11 | 0.08 | 0.10 | 0.11 | 0.12 | 0.15 | 0.06 | 0.05 | 0.06 | 0.07 | 0.07 | 0.08 | 0.06 | 0.05 | 0.06 | 0.07 | 0.07 | 0.08 | 8 | 16 | 32 | 64 | 128 | 192 | 0.03 | 0.04 | 0.07 | 0.13 | 0.19 | 0.26 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.51 | 0 | 0.22 | 0 | 0.1 | 0 | 0.05 | 0 | 0.03 | 0 | |
○_ZN17_INTERNAL021345c126__kmp_hyper_barrier_gatherE12barrier_typeP8kmp_infoiiPFvPvS3_ES3_..0 | libiomp5.so | 0.05 | 0.07 | 0.19 | 0.24 | 0.38 | 0.27 | 0.05 | 0.07 | 0.19 | 0.24 | 0.38 | 0.27 | 0.31 | 0.18 | 0.72 | 0.67 | 0.95 | 0.94 | 0.31 | 0.18 | 0.72 | 0.67 | 0.95 | 0.94 | 0.05 | 0.04 | 0.05 | 0.04 | 0.04 | 0.02 | 0.05 | 0.04 | 0.05 | 0.04 | 0.04 | 0.02 | 2 | 5 | 14 | 24 | 36 | 44 | 0.16 | 0.14 | 0.73 | 1.06 | 2.32 | 2.53 | 0.16 | 0.07 | 0.20 | 0.16 | 0.23 | 0.20 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 1 | 0 | 0.71 | 0 | 0.24 | 0 | 0.17 | 0 | 0.08 | 0 | 0.1 | 0 | |
►main.extracted.8 | exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.04 | 0.04 | 0.06 | 0.06 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.05 | 0.05 | 0.02 | 0.02 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 188 | 0.01 | 0.02 | 0.02 | 0.04 | 0.06 | 0.08 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 19.13 | 30.78 | 78.77 | 81.34 | 153.25 | 150.85 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) /cluster/intel/oneapi/2024.0.0/compiler/2024.0/bin/compiler/clang --intel -I /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eosere... | 1 | 0 | 0.79 | 0 | 0.77 | 0 | 0.44 | 0 | 0.33 | 0 | 0.21 | 0 |
○Loop 4 - main.c:139-146 - exec | 0.03 | 0.04 | 0.04 | 0.06 | 0.06 | 0.08 | 0.03 | 0.04 | 0.04 | 0.06 | 0.06 | 0.08 | 0.05 | 0.05 | 0.02 | 0.02 | 0.02 | 0.03 | 0.05 | 0.05 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 188 | 0.01 | 0.02 | 0.02 | 0.04 | 0.06 | 0.08 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 19.13 | 30.78 | 78.53 | 81.50 | 152.41 | 152.17 | 1 | 0 | 0.79 | 0.01 | 0.77 | 0.01 | 0.45 | 0.03 | 0.33 | 0.04 | 0.21 | 0.06 | ||||||||
○__GI___sched_yield | libc.so.6 | 0.02 | 0.05 | 0.09 | 0.17 | 0.27 | 0.37 | 0.02 | 0.05 | 0.09 | 0.17 | 0.27 | 0.37 | 0.04 | 0.05 | 0.04 | 0.05 | 0.06 | 0.07 | 0.04 | 0.05 | 0.04 | 0.05 | 0.06 | 0.07 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 8 | 15 | 32 | 63 | 125 | 190 | 0.01 | 0.03 | 0.04 | 0.08 | 0.13 | 0.16 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.36 | 0 | 0.18 | 0 | 0.09 | 0 | 0.04 | 0 | 0.03 | 0 | |
►__intel_avx_rep_memset | exec | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.09 | 0.13 | 0.10 | 0.11 | 0.12 | 0.09 | 0.09 | 0.13 | 0.10 | 0.11 | 0.12 | 0.09 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 8.32 | 10.20 | 35.21 | 62.66 | 90.32 | 175.88 | 1 | 0 | 0.67 | 0 | 0.85 | 0 | 0.81 | 0 | 0.71 | 0 | 0.93 | 0 | |
○Loop 9 - - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |