| Name | Module | Max Thread Time / Walltime orig_0 (%) | Coverage orig_0 (%) | Coverage Excluding Loops orig_0 (%) | Max Inclusive Time Over Threads orig_0 (s) | Max Exclusive Time Over Threads orig_0 (s) | Inclusive Time w.r.t. Wall Time orig_0 (s) | Exclusive Time w.r.t. Wall Time orig_0 (s) | Nb Threads orig_0 | Deviation (coverage) orig_0 | Deviation (walltime) orig_0 | Categories orig_0 | GFLOPS orig_0 | Compilation Options |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ►cg_calc_ur(int, int, int, double, double*, double*, double const*, double*, double const*) [clone .omp_outlined] | exec | 43.78 | 43.29 | 0.02 | 116.34 | 2.75 | 114.96 | 0.04 | 64 | 0.37 | 0.97 | Exe (%): 100.00 | 35.42 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ○Loop 18 - cg.cpp:105-105 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ►Loop 20 - cg.cpp:107-113 - exec | 0.00 | 43.27 | 0.00 | 116.31 | 0.00 | 114.92 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 19 - cg.cpp:108-113 - exec | 43.76 | 43.27 | 43.27 | 116.31 | 116.31 | 114.92 | 114.92 | 64 | 0.37 | 0.98 | 35.42 | |||
| ►cg_calc_w(int, int, int, double*, double const*, double*, double const*, double const*) [clone .omp_outlined] | exec | 33.75 | 32.87 | 0.01 | 89.70 | 1.94 | 87.30 | 0.03 | 64 | 0.36 | 0.96 | Exe (%): 100.00 | 116.29 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ○Loop 15 - cg.cpp:83-83 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ►Loop 17 - cg.cpp:85-90 - exec | 0.07 | 32.86 | 0.05 | 89.75 | 0.19 | 87.27 | 0.13 | 64 | 0.01 | 0.03 | 46.36 | |||
| ○Loop 16 - cg.cpp:86-90 - exec | 33.70 | 32.81 | 32.81 | 89.56 | 89.56 | 87.14 | 87.14 | 64 | 0.36 | 0.96 | 116.43 | |||
| ►cg_calc_p(int, int, int, double, double*, double const*) [clone .omp_outlined] | exec | 20.78 | 20.52 | 0.01 | 55.23 | 1.05 | 54.48 | 0.02 | 64 | 0.15 | 0.39 | Exe (%): 100.00 | 24.88 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ►Loop 22 - cg.cpp:127-131 - exec | 0.04 | 20.51 | 0.02 | 55.22 | 0.10 | 54.47 | 0.06 | 61 | 0.01 | 0.02 | 55.36 | |||
| ○Loop 21 - cg.cpp:128-131 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 23 - cg.cpp:128-131 - exec | 20.74 | 20.49 | 20.49 | 55.12 | 55.12 | 54.41 | 54.41 | 64 | 0.15 | 0.39 | 24.85 | |||
| ○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 2.65 | 1.88 | 1.88 | 7.04 | 320.36 | 5.01 | 5.01 | 64 | 0.42 | 1.11 | OMP (%): 100.00 | 0.00 | |
| ○arch_local_irq_enable | kernel | 0.69 | 0.41 | 0.41 | 1.82 | 69.80 | 1.09 | 1.09 | 64 | 0.12 | 0.32 | OMP (%): 99.96 System (%): 0.04 MPI (%): 0.01 | 0.00 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 0.35 | 0.21 | 0.21 | 0.93 | 36.26 | 0.57 | 0.57 | 64 | 0.05 | 0.14 | OMP (%): 100.00 | 0.00 | |
| ○el0_svc_common.constprop.0 | kernel | 0.25 | 0.16 | 0.16 | 0.68 | 26.95 | 0.42 | 0.42 | 64 | 0.04 | 0.12 | OMP (%): 100.00 | 0.00 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.16 | 0.08 | 0.08 | 0.42 | 13.70 | 0.21 | 0.21 | 64 | 0.04 | 0.11 | OMP (%): 100.00 | 0.00 | |
| ○__GI___sched_yield | libc.so.6 | 0.10 | 0.06 | 0.06 | 0.27 | 9.47 | 0.15 | 0.15 | 64 | 0.02 | 0.05 | OMP (%): 100.00 | 0.00 | |
| ►update_left(int, int, int, int, double*, bool) [clone .omp_outlined] | exec | 0.06 | 0.04 | 0.01 | 0.17 | 1.12 | 0.11 | 0.02 | 64 | 0.01 | 0.03 | Exe (%): 100.00 | 0.00 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ►Loop 71 - local_halos.cpp:10-15 - exec | 0.00 | 0.03 | 0.00 | 0.14 | 0.00 | 0.09 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 70 - local_halos.cpp:13-15 - exec | 0.05 | 0.03 | 0.03 | 0.14 | 0.14 | 0.09 | 0.09 | 64 | 0.01 | 0.02 | 0.00 | |||
| ○unknown_function | [vdso] | 0.05 | 0.03 | 0.03 | 0.14 | 4.94 | 0.08 | 0.08 | 64 | 0.01 | 0.03 | OMP (%): 100.00 | 0.00 | |
| ○__schedule | kernel | 0.05 | 0.03 | 0.03 | 0.14 | 4.50 | 0.07 | 0.07 | 63 | 0.01 | 0.03 | OMP (%): 100.00 | 0.00 | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.05 | 0.02 | 0.02 | 0.13 | 4.08 | 0.06 | 0.06 | 61 | 0.01 | 0.02 | OMP (%): 100.00 | 0.06 | |
| ○__kmpc_reduce_nowait | libomp.so | 0.05 | 0.02 | 0.02 | 0.14 | 3.71 | 0.06 | 0.06 | 60 | 0.01 | 0.03 | OMP (%): 100.00 | 0.00 | |
| ►update_right(int, int, int, int, double*, bool) [clone .omp_outlined] | exec | 0.04 | 0.02 | 0.01 | 0.09 | 1.29 | 0.06 | 0.02 | 61 | 0.01 | 0.02 | Exe (%): 100.00 | 0.00 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ►Loop 73 - local_halos.cpp:25-30 - exec | 0.00 | 0.01 | 0.00 | 0.07 | 0.00 | 0.04 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 72 - local_halos.cpp:28-30 - exec | 0.03 | 0.01 | 0.01 | 0.07 | 0.07 | 0.04 | 0.04 | 53 | 0.00 | 0.01 | 0.00 | |||
| ○schedule_debug.constprop.0 | kernel | 0.04 | 0.02 | 0.02 | 0.10 | 3.62 | 0.06 | 0.06 | 58 | 0.01 | 0.02 | OMP (%): 100.00 | 0.00 | |
| ○__kmp_launch_thread | libomp.so | 0.11 | 0.02 | 0.02 | 0.28 | 3.49 | 0.05 | 0.05 | 55 | 0.02 | 0.05 | OMP (%): 100.00 | 0.00 | |
| ○down_read_trylock | kernel | 0.03 | 0.02 | 0.02 | 0.08 | 3.22 | 0.05 | 0.05 | 62 | 0.00 | 0.01 | System (%): 100.00 | 0.00 | |
| ○__aarch64_ldadd8_acq_rel | libomp.so | 0.15 | 0.02 | 0.02 | 0.40 | 3.18 | 0.05 | 0.05 | 23 | 0.04 | 0.10 | OMP (%): 100.00 | 0.00 | |
| ►cg_init(int, int, int, int, double, double, double*, double const*, double const*, double*, double*, double*, double*, double*, double*) [clone .omp_outlined.6] | exec | 0.02 | 0.02 | 0.00 | 0.05 | 0.00 | 0.05 | 0.00 | 63 | 0.00 | 0.00 | Exe (%): 100.00 | 57.99 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ○Loop 12 - cg.cpp:59-59 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ►Loop 14 - cg.cpp:61-68 - exec | 0.00 | 0.02 | 0.00 | 0.05 | 0.00 | 0.05 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 13 - cg.cpp:62-68 - exec | 0.02 | 0.02 | 0.02 | 0.05 | 0.05 | 0.05 | 0.05 | 63 | 0.00 | 0.00 | 57.96 | |||
| ○unknown_function | exec | 0.04 | 0.02 | 0.02 | 0.11 | 2.62 | 0.04 | 0.04 | 50 | 0.01 | 0.02 | Exe (%): 100.00 | 1.50 | |
| ○__kmp_barrier | libomp.so | 0.03 | 0.01 | 0.01 | 0.09 | 2.52 | 0.04 | 0.04 | 52 | 0.01 | 0.02 | OMP (%): 100.00 | 0.00 | |
| ►update_bottom(int, int, int, int, double*, bool) [clone .omp_outlined] | exec | 0.02 | 0.01 | 0.01 | 0.06 | 1.62 | 0.03 | 0.03 | 50 | 0.00 | 0.01 | Exe (%): 100.00 | 0.00 | Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ... |
| ○Loop 78 - local_halos.cpp:60-62 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | 0.00 | |||
| ○Loop 79 - local_halos.cpp:60-62 - exec | 0.02 | 0.00 | 0.00 | 0.05 | 0.05 | 0.01 | 0.01 | 12 | 0.00 | 0.01 | 0.00 |