Run 1x8 | Number processes: 1Number nodes: 1Run Command: <executable> MPI Command: mpirun -n <number_processes>Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/run/oneview_runs/multicore/icx_3/oneview_run_1722906572OMP_PROC_BIND: spreadOMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
---|---|
Run 1x16 | Number processes: 1OMP_NUM_THREADS: 16OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x32 | Number processes: 1OMP_NUM_THREADS: 32OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x64 | Number processes: 1OMP_NUM_THREADS: 64OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x128 | Number processes: 1OMP_NUM_THREADS: 128OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Run 1x192 | Number processes: 1OMP_NUM_THREADS: 192OMP_PROC_BIND: spreadI_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threads |
Loop id | Source Location | Source Function | Level | Exclusive coverage 1x8 (%) | Exclusive coverage 1x16 (%) | Exclusive coverage 1x32 (%) | Exclusive coverage 1x64 (%) | Exclusive coverage 1x128 (%) | Exclusive coverage 1x192 (%) | Inclusive coverage 1x8 (%) | Inclusive coverage 1x16 (%) | Inclusive coverage 1x32 (%) | Inclusive coverage 1x64 (%) | Inclusive coverage 1x128 (%) | Inclusive coverage 1x192 (%) | Max Exclusive Time Over Threads 1x8 (s) | Max Exclusive Time Over Threads 1x16 (s) | Max Exclusive Time Over Threads 1x32 (s) | Max Exclusive Time Over Threads 1x64 (s) | Max Exclusive Time Over Threads 1x128 (s) | Max Exclusive Time Over Threads 1x192 (s) | Max Inclusive Time Over Threads 1x8 (s) | Max Inclusive Time Over Threads 1x16 (s) | Max Inclusive Time Over Threads 1x32 (s) | Max Inclusive Time Over Threads 1x64 (s) | Max Inclusive Time Over Threads 1x128 (s) | Max Inclusive Time Over Threads 1x192 (s) | Exclusive Time w.r.t. Wall Time 1x8 (s) | Exclusive Time w.r.t. Wall Time 1x16 (s) | Exclusive Time w.r.t. Wall Time 1x32 (s) | Exclusive Time w.r.t. Wall Time 1x64 (s) | Exclusive Time w.r.t. Wall Time 1x128 (s) | Exclusive Time w.r.t. Wall Time 1x192 (s) | Inclusive Time w.r.t. Wall Time 1x8 (s) | Inclusive Time w.r.t. Wall Time 1x16 (s) | Inclusive Time w.r.t. Wall Time 1x32 (s) | Inclusive Time w.r.t. Wall Time 1x64 (s) | Inclusive Time w.r.t. Wall Time 1x128 (s) | Inclusive Time w.r.t. Wall Time 1x192 (s) | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x64 | Nb Threads 1x128 | Nb Threads 1x192 | GFLOPS 1x8 | GFLOPS 1x16 | GFLOPS 1x32 | GFLOPS 1x64 | GFLOPS 1x128 | GFLOPS 1x192 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 1x8 | Speedup If Perfect Load Balancing 1x16 | Speedup If Perfect Load Balancing 1x32 | Speedup If Perfect Load Balancing 1x64 | Speedup If Perfect Load Balancing 1x128 | Speedup If Perfect Load Balancing 1x192 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x64) Efficiency | (1x64) Potential Speed-Up (%) | (1x128) Efficiency | (1x128) Potential Speed-Up (%) | (1x192) Efficiency | (1x192) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5 | exec - Step10_orig.c:19-35 | Step10_orig | Single | 97.04 | 94.69 | 88.94 | 80.03 | 67.68 | 55.92 | 97.04 | 94.69 | 88.94 | 80.03 | 67.68 | 55.92 | 93.50 | 46.76 | 24.05 | 12.47 | 6.92 | 4.98 | 93.50 | 46.76 | 24.05 | 12.47 | 6.92 | 4.98 | 93.49 | 46.71 | 23.81 | 12.25 | 6.66 | 4.40 | 93.49 | 46.71 | 23.81 | 12.25 | 6.66 | 4.40 | 8 | 16 | 32 | 64 | 128 | 192 | 46.11 | 92.29 | 181.03 | 351.88 | 647.33 | 979.82 | 100 | 45.63 | 1 | 1 | 1 | 1 | 1 | 1.02 | 1.03 | 1.06 | 1.15 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.98 | 1.63 | 0.95 | 3.68 | 0.88 | 8.28 | 0.89 | 6.4 |
1 | exec - main.c:111-116 | main | Innermost | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 0.34 | 0.34 | 0.33 | 0.27 | 0.23 | 0.19 | 2.59 | 2.69 | 2.78 | 2.64 | 2.83 | 2.81 | 2.59 | 2.69 | 2.78 | 2.64 | 2.83 | 2.81 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 0.32 | 0.17 | 0.09 | 0.04 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1.55 | 2.77 | 5.31 | 12.12 | 21.07 | 30.51 | 92.86 | 46.88 | 1 | 1 | 1.08 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0.96 | 0.01 | 0.93 | 0.02 | 0.97 | 0.01 | 0.9 | 0.02 | 0.9 | 0.02 |
4 | exec - main.c:139-146 | main.extracted.8 | Single | 0.03 | 0.04 | 0.04 | 0.06 | 0.06 | 0.08 | 0.03 | 0.04 | 0.04 | 0.06 | 0.06 | 0.08 | 0.05 | 0.05 | 0.02 | 0.02 | 0.02 | 0.03 | 0.05 | 0.05 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 188 | 19.13 | 30.78 | 78.53 | 81.50 | 152.41 | 152.17 | 0 | 6.82 | 1.07 | 1.06 | 9.14 | 1.6 | 2.54 | 1.97 | 2.88 | 4.27 | 4.78 | 2 | 0 | 0 | 1 | 0 | 1 | 0 | 0.79 | 0.01 | 0.77 | 0.01 | 0.45 | 0.03 | 0.33 | 0.04 | 0.21 | 0.06 |