Name | Module | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Deviation (coverage) run_0 | Deviation (walltime) run_0 | Categories run_0 | GFLOPS run_0 | Compilation Options |
---|---|---|---|---|---|---|---|---|---|---|
►_ZN16miniqmcreference19MultiBsplineEvalRef10evaluate_vIdEEvPKN11qmcplusplus14bspline_traitsIT_Lj3EE10SplineTypeES4_S4_S4_PS4_m | exec | 28.41 | 66.76 | 59.18 | 192 | 1.78 | 5.70 | Exe (%): 100.00 | 17.31 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 844 - MultiBsplineRef.hpp:63-71 - exec | 0.01 | 0.04 | 0.01 | 190 | 0.00 | 0.01 | 39.25 | |||
○Loop 843 - MultiBsplineRef.hpp:68-70 - exec | 7.12 | 16.83 | 14.84 | 192 | 0.47 | 1.46 | 17.24 | |||
○Loop 840 - MultiBsplineRef.hpp:68-70 - exec | 7.12 | 17.11 | 14.83 | 192 | 0.47 | 1.46 | 17.23 | |||
○Loop 842 - MultiBsplineRef.hpp:68-70 - exec | 7.07 | 16.85 | 14.72 | 192 | 0.45 | 1.43 | 17.36 | |||
○Loop 841 - MultiBsplineRef.hpp:68-70 - exec | 7.06 | 17.02 | 14.72 | 192 | 0.45 | 1.42 | 17.37 | |||
○Loop 849 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 852 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 848 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 851 - MultiBsplineRef.hpp:68-71 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 847 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 850 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 846 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 845 - MultiBsplineRef.hpp:68-70 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○mkl_blas_def_dgemm_pst | libmkl_def.so.2 | 23.95 | 52.85 | 49.91 | 192 | 1.04 | 1.56 | Math (%): 100.00 | 236.80 | |
○mkl_blas_def_dgemm_kernel_zen | libmkl_def.so.2 | 15.01 | 31.75 | 31.27 | 192 | 0.72 | 1.49 | Math (%): 100.00 | 771.17 | |
○mkl_blas_def_dgemm_copybn_bdz | libmkl_def.so.2 | 7.79 | 17.4 | 16.22 | 192 | 0.37 | 0.73 | Math (%): 100.00 | 0.01 | |
►_ZN16miniqmcreference19MultiBsplineEvalRef12evaluate_vghIdEEvPKN11qmcplusplus14bspline_traitsIT_Lj3EE10SplineTypeES4_S4_S4_PS4_S9_S9_m | exec | 6.8 | 16.08 | 14.16 | 192 | 0.32 | 1.13 | Exe (%): 100.00 | 180.66 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 855 - MultiBsplineRef.hpp:276-286 - exec | 0.11 | 0.42 | 0.22 | 192 | 0.03 | 0.05 | 333.71 | |||
○Loop 854 - MultiBsplineRef.hpp:276-286 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 856 - MultiBsplineRef.hpp:276-286 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►Loop 857 - MultiBsplineRef.hpp:226-262 - exec [...] | 0 | 0.03 | 0.01 | 183 | 0.00 | 0.01 | 54.38 | |||
►Loop 858 - MultiBsplineRef.hpp:226-262 - exec [...] | 0.01 | 0.06 | 0.03 | 192 | 0.01 | 0.01 | 104.54 | |||
○Loop 860 - MultiBsplineRef.hpp:242-262 - exec | 6.66 | 15.82 | 13.87 | 192 | 0.32 | 1.13 | 178.77 | |||
○Loop 861 - MultiBsplineRef.hpp:242-262 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 859 - MultiBsplineRef.hpp:242-262 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN11qmcplusplus27SoaDistanceTableABOMPTargetIdLj3ELi40EE8evaluateERNS_11ParticleSetE | exec | 4.53 | 10.04 | 9.43 | 192 | 0.16 | 0.57 | Exe (%): 100.00 | 113.81 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I /beegfs/hac... |
►Loop 2033 - SoaDistanceTableABOMPTarget.h:214-228 - exec [...] | 0.01 | 0.06 | 0.02 | 192 | 0.01 | 0.01 | 114.06 | |||
►Loop 2034 - SoaDistanceTableABOMPTarget.h:215-228 - exec [...] | 0.02 | 0.13 | 0.04 | 192 | 0.01 | 0.02 | 169.22 | |||
○Loop 2036 - ParticleBConds3DSoa.h:280-298 - exec [...] | 4.48 | 9.98 | 9.34 | 192 | 0.17 | 0.58 | 113.92 | |||
○Loop 2037 - ParticleBConds3DSoa.h:280-298 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 2035 - SoaDistanceTableABOMPTarget.h:228-228 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 2040 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0.01 | 0 | 19 | 0.00 | 0.00 | 0.00 | |||
○Loop 2039 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0.01 | 0 | 76 | 0.00 | 0.00 | 0.00 | |||
○Loop 2038 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN11qmcplusplus27SoaDistanceTableAAOMPTargetIdLj3ELi40EE6updateEi | exec | 1.77 | 4.53 | 3.69 | 192 | 0.19 | 0.43 | Exe (%): 100.00 | 26.88 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I /beegfs/hac... |
○Loop 1791 - SoaDistanceTableAAOMPTarget.h:440-442 - exec [...] | 1.77 | 4.53 | 3.68 | 192 | 0.19 | 0.43 | 26.94 | |||
○unknown_function | Unknown module | 1.51 | 3.47 | 3.14 | 194 | 4.58 | 0.36 | Others (%): 99.95 Math (%): 0.05 | 67.61 | |
►_ZN16miniqmcreference17TwoBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE14evaluateRatiosERNS1_18VirtualParticleSetERSt6vectorIdSaIdEE | exec | 1.38 | 3.13 | 2.87 | 192 | 0.09 | 0.21 | Exe (%): 100.00 | 171.08 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 322 - TwoBodyJastrowRef.h:107-132 - exec [...] | 0.01 | 0.06 | 0.03 | 192 | 0.01 | 0.01 | 156.34 | |||
►Loop 323 - BsplineFunctor.h:236-260 - exec [...] | 0.03 | 0.13 | 0.07 | 192 | 0.01 | 0.02 | 175.31 | |||
○Loop 326 - BsplineFunctor.h:236-241 - exec | 1.3 | 2.93 | 2.71 | 192 | 0.09 | 0.21 | 168.55 | |||
○Loop 324 - BsplineFunctor.h:246-260 - exec [...] | 0.03 | 0.11 | 0.06 | 192 | 0.01 | 0.02 | 221.29 | |||
○Loop 325 - BsplineFunctor.h:246-260 - exec | 0 | 0.02 | 0.01 | 192 | 0.00 | 0.01 | 398.63 | |||
○__GI___sched_yield | libc.so.6 | 1.35 | 2.97 | 2.81 | 192 | 0.17 | 0.29 | System (%): 100.00 | 0.00 | |
○mkl_blas_def_xdgemv | libmkl_def.so.2 | 1.07 | 2.92 | 2.22 | 192 | 0.14 | 0.31 | Math (%): 100.00 | 519.21 | |
○mkl_blas_def_dgemm_copyan_bdz | libmkl_def.so.2 | 0.75 | 1.66 | 1.57 | 192 | 0.05 | 0.09 | Math (%): 100.00 | 269.03 | |
►_ZNK11qmcplusplus10DTD_BCondsIdLj3ELi40EE16computeDistancesINS_10TinyVectorIdLj3EEENS_18VectorSoAContainerIdLj3ENS_10MallocatorIdLm64EEEEES8_EEvRKT_RKT0_PdRT1_iii | exec | 0.67 | 1.71 | 1.4 | 192 | 0.07 | 0.16 | Exe (%): 100.00 | 384.76 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I... |
○Loop 1310 - ParticleBConds3DSoa.h:234-255 - exec | 0.67 | 1.7 | 1.39 | 192 | 0.07 | 0.16 | 386.83 | |||
○Loop 1309 - ParticleBConds3DSoa.h:234-255 - exec | 0 | 0 | 0 | 33 | 0.00 | 0.00 | 0.00 | |||
○Loop 1308 - ParticleBConds3DSoa.h:234-255 - exec | 0 | 0 | 0 | 38 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference17TwoBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE10acceptMoveERNS1_11ParticleSetEi | exec | 0.49 | 1.34 | 1.03 | 192 | 0.05 | 0.12 | Exe (%): 100.00 | 85.73 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 316 - TwoBodyJastrowRef.h:324-331 - exec | 0.14 | 0.42 | 0.28 | 192 | 0.02 | 0.05 | 113.61 | |||
○Loop 311 - TwoBodyJastrowRef.h:342-347 - exec | 0.12 | 0.33 | 0.24 | 192 | 0.02 | 0.04 | 76.77 | |||
○Loop 313 - TwoBodyJastrowRef.h:342-347 - exec | 0.12 | 0.34 | 0.24 | 192 | 0.02 | 0.04 | 76.48 | |||
○Loop 309 - TwoBodyJastrowRef.h:342-347 - exec | 0.12 | 0.33 | 0.25 | 192 | 0.02 | 0.04 | 72.97 | |||
○Loop 314 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 308 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 315 - TwoBodyJastrowRef.h:324-331 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 310 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 306 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 319 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0.01 | 0 | 144 | 0.00 | 0.00 | 0.00 | |||
○Loop 318 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 312 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 307 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 317 - TwoBodyJastrowRef.h:324-331 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference19DiracDeterminantRefIN11qmcplusplus13DelayedUpdateIddEEE10evaluateGLERNS1_11ParticleSetERNS1_14ParticleAttribINS1_10TinyVectorIdLj3EEESaIS9_EEERNS7_IdSaIdEEEb | exec | 0.48 | 1.6 | 1 | 192 | 0.09 | 0.20 | Exe (%): 100.00 | 38.89 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 1060 - inner_product.hpp:82-155 - exec [...] | 0 | 0.01 | 0 | 105 | 0.00 | 0.00 | 0.00 | |||
○Loop 1059 - inner_product.hpp:155-155 - exec [...] | 0.3 | 1.03 | 0.63 | 192 | 0.06 | 0.13 | 53.14 | |||
○Loop 1058 - inner_product.hpp:82-83 - exec | 0.18 | 0.66 | 0.37 | 192 | 0.04 | 0.09 | 14.35 | |||
○Loop 1064 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1063 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1062 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1061 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1057 - DiracDeterminantRef.cpp:173-173 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference17einspline_spo_refIdE8evaluateERKN11qmcplusplus11ParticleSetEiRNS2_6VectorIdSaIdEEERNS6_INS2_10TinyVectorIdLj3EEESaISB_EEES9_ | exec | 0.46 | 1.19 | 0.96 | 192 | 0.05 | 0.11 | Exe (%): 100.00 | 225.77 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 830 - einspline_spo_ref.hpp:219-227 - exec [...] | 0 | 0.01 | 0 | 34 | 0.00 | 0.00 | 0.00 | |||
○Loop 831 - einspline_spo_ref.hpp:223-227 - exec [...] | 0.46 | 1.18 | 0.96 | 192 | 0.05 | 0.11 | 225.70 | |||
►_ZN11qmcplusplus6SPOSet17evaluateDetRatiosERKNS_18VirtualParticleSetERNS_6VectorIdSaIdEEERKS6_RSt6vectorIdS5_E | exec | 0.4 | 1.02 | 0.83 | 192 | 0.07 | 0.12 | Exe (%): 100.00 | 25.32 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 832 - inner_product.hpp:82-83 - exec [...] | 0.04 | 0.15 | 0.08 | 192 | 0.01 | 0.03 | 30.72 | |||
○Loop 833 - inner_product.hpp:82-83 - exec | 0.36 | 0.92 | 0.74 | 192 | 0.06 | 0.11 | 24.93 | |||
○Loop 835 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 834 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○__kmp_hardware_timestamp | libomp.so | 0.39 | 2.48 | 0.8 | 192 | 0.17 | 0.31 | OMP (%): 100.00 | 0.00 | |
○_ZL27__kmp_hyper_barrier_release12barrier_typeP8kmp_infoiiiPv | libomp.so | 0.29 | 0.71 | 0.61 | 190 | 0.07 | 0.12 | OMP (%): 100.00 | 0.00 | |
►_ZNK11qmcplusplus14BsplineFunctorIdE11evaluateVGLEiiiPKdPdS4_S4_S4_Pi | exec | 0.27 | 0.7 | 0.56 | 192 | 0.03 | 0.06 | Exe (%): 100.00 | 402.23 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 265 - BsplineFunctor.h:291-298 - exec | 0.22 | 0.56 | 0.45 | 192 | 0.02 | 0.05 | 378.42 | |||
○Loop 262 - BsplineFunctor.h:302-336 - exec [...] | 0.02 | 0.07 | 0.04 | 192 | 0.01 | 0.01 | 925.70 | |||
○Loop 263 - BsplineFunctor.h:302-336 - exec | 0 | 0.02 | 0 | 192 | 0.00 | 0.00 | 0.00 | |||
○Loop 264 - BsplineFunctor.h:302-336 - exec [...] | 0 | 0.02 | 0 | 192 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference19DiracDeterminantRefIN11qmcplusplus13DelayedUpdateIddEEE8evalGradERNS1_11ParticleSetEi | exec | 0.23 | 0.6 | 0.47 | 192 | 0.03 | 0.06 | Exe (%): 100.00 | 74.80 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 1045 - inner_product.hpp:155-155 - exec [...] | 0.22 | 0.59 | 0.46 | 192 | 0.03 | 0.06 | 73.79 | |||
○Loop 1047 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1046 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference17TwoBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE9ratioGradERNS1_11ParticleSetEiRNS1_10TinyVectorIdLj3EEE | exec | 0.18 | 0.5 | 0.37 | 192 | 0.03 | 0.06 | Exe (%): 100.00 | 92.39 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 294 - TwoBodyJastrowRef.h:155-156 - exec | 0.05 | 0.17 | 0.11 | 192 | 0.01 | 0.02 | 82.65 | |||
○Loop 293 - TwoBodyJastrowRef.h:155-156 - exec | 0.05 | 0.17 | 0.11 | 192 | 0.01 | 0.03 | 77.98 | |||
○Loop 295 - TwoBodyJastrowRef.h:155-156 - exec | 0.05 | 0.18 | 0.11 | 192 | 0.01 | 0.03 | 83.02 | |||
○Loop 303 - stl_numeric.h:141-141 - exec | 0.01 | 0.04 | 0.02 | 192 | 0.00 | 0.01 | 282.82 | |||
○Loop 305 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0.04 | 0.01 | 188 | 0.00 | 0.01 | 60.50 | |||
○Loop 301 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 302 - stl_numeric.h:140-141 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 300 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 296 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 298 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 299 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 297 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 304 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN11qmcplusplus11DiracMatrixIddE16invert_transposeERKNS_6MatrixIdSaIdEEERS4_RdS8_ | exec | 0.14 | 0.75 | 0.29 | 192 | 0.07 | 0.13 | Exe (%): 100.00 | 24.93 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 1019 - DiracMatrix.h:31-35 - exec [...] | 0 | 0 | 0 | 12 | 0.00 | 0.00 | 0.00 | |||
○Loop 1020 - DiracMatrix.h:112-113 - exec | 0 | 0 | 0 | 13 | 0.00 | 0.00 | 0.00 | |||
►Loop 1022 - inner_product.hpp:210-212 - exec | 0 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | |||
○Loop 1023 - inner_product.hpp:211-212 - exec | 0.14 | 0.75 | 0.29 | 192 | 0.07 | 0.13 | 24.83 | |||
○Loop 1021 - inner_product.hpp:211-212 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1024 - inner_product.hpp:211-212 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○mkl_serv_lock | libmkl_core.so.2 | 0.14 | 0.47 | 0.29 | 190 | 0.02 | 0.04 | Math (%): 100.00 | 0.00 | |
○mkl_blas_def_dtrsm_i | libmkl_def.so.2 | 0.1 | 0.28 | 0.22 | 192 | 0.02 | 0.03 | Math (%): 100.00 | 1970.23 | |
○_ZL26__kmp_hyper_barrier_gather12barrier_typeP8kmp_infoiiPFvPvS2_ES2_ | libomp.so | 0.1 | 1.88 | 0.21 | 92 | 0.17 | 0.31 | OMP (%): 100.00 | 0.00 | |
►_ZN11qmcplusplus10NonLocalPPIdE8evaluateERKNS_11ParticleSetERNS_12WaveFunctionE | exec | 0.09 | 0.24 | 0.19 | 192 | 0.02 | 0.03 | Exe (%): 100.00 | 48.03 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I /beegfs/hac... |
○Loop 61 - OhmmsVector.h:144-210 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►Loop 62 - NonLocalPP.hpp:122-135 - exec [...] | 0 | 0.04 | 0.01 | 187 | 0.00 | 0.01 | 51.50 | |||
►Loop 63 - NonLocalPP.hpp:126-135 - exec [...] | 0.08 | 0.22 | 0.16 | 192 | 0.02 | 0.03 | 51.35 | |||
○Loop 64 - NonLocalPP.hpp:131-132 - exec [...] | 0.01 | 0.04 | 0.01 | 190 | 0.00 | 0.01 | 39.38 | |||
○Loop 65 - stl_algobase.h:911-912 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference19DiracDeterminantRefIN11qmcplusplus13DelayedUpdateIddEEE11evaluateLogERNS1_11ParticleSetERNS1_14ParticleAttribINS1_10TinyVectorIdLj3EEESaIS9_EEERNS7_IdSaIdEEE | exec | 0.08 | 0.2 | 0.16 | 192 | 0.02 | 0.04 | Exe (%): 100.00 | 48.02 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 1040 - inner_product.hpp:82-155 - exec [...] | 0 | 0 | 0 | 26 | 0.00 | 0.00 | 0.00 | |||
○Loop 1038 - inner_product.hpp:155-155 - exec [...] | 0.06 | 0.18 | 0.13 | 192 | 0.02 | 0.03 | 50.64 | |||
○Loop 1039 - inner_product.hpp:82-83 - exec | 0.02 | 0.06 | 0.04 | 192 | 0.01 | 0.01 | 27.00 | |||
○Loop 1042 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1041 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1043 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1044 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○mkl_blas_def_dgemm_mscale | libmkl_def.so.2 | 0.08 | 0.24 | 0.18 | 192 | 0.01 | 0.03 | Math (%): 100.00 | 50.04 | |
►_ZN16miniqmcreference17OneBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE14evaluateRatiosERNS1_18VirtualParticleSetERSt6vectorIdSaIdEE | exec | 0.08 | 0.23 | 0.16 | 192 | 0.02 | 0.03 | Exe (%): 100.00 | 20.29 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 227 - OneBodyJastrowRef.h:134-155 - exec [...] | 0.01 | 0.05 | 0.02 | 192 | 0.01 | 0.01 | 23.75 | |||
►Loop 229 - BsplineFunctor.h:233-260 - exec [...] | 0.01 | 0.03 | 0.01 | 180 | 0.00 | 0.01 | 17.63 | |||
○Loop 232 - BsplineFunctor.h:236-241 - exec | 0.06 | 0.18 | 0.12 | 192 | 0.02 | 0.03 | 20.86 | |||
○Loop 230 - BsplineFunctor.h:246-260 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 231 - BsplineFunctor.h:246-260 - exec | 0 | 0.01 | 0 | 116 | 0.00 | 0.00 | 0.00 | |||
○Loop 228 - OneBodyJastrowRef.h:151-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○__kmp_api_omp_get_level | libomp.so | 0.06 | 0.18 | 0.12 | 192 | 0.02 | 0.03 | OMP (%): 100.00 | 13.29 | |
○mkl_blas_def_dgemm_copyat_bdz | libmkl_def.so.2 | 0.06 | 0.19 | 0.12 | 192 | 0.01 | 0.02 | Math (%): 100.00 | 164.15 | |
►_ZN16miniqmcreference19DiracDeterminantRefIN11qmcplusplus13DelayedUpdateIddEEE9ratioGradERNS1_11ParticleSetEiRNS1_10TinyVectorIdLj3EEE | exec | 0.06 | 0.18 | 0.12 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 303.80 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 1048 - inner_product.hpp:82-83 - exec | 0.03 | 0.13 | 0.07 | 192 | 0.01 | 0.02 | 56.61 | |||
○Loop 1049 - inner_product.hpp:155-155 - exec [...] | 0.02 | 0.08 | 0.04 | 192 | 0.01 | 0.01 | 747.26 | |||
○Loop 1053 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1051 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1050 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 1052 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference17einspline_spo_refIdE10evaluate_vERKN11qmcplusplus11ParticleSetEi | exec | 0.06 | 0.22 | 0.12 | 192 | 0.02 | 0.03 | Exe (%): 100.00 | 8.59 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 839 - einspline_spo_ref.hpp:175-176 - exec [...] | 0.02 | 0.07 | 0.03 | 192 | 0.01 | 0.01 | 13.50 | |||
○mkl_blas_def_dtrsml2x2_lln | libmkl_def.so.2 | 0.06 | 0.17 | 0.13 | 192 | 0.01 | 0.02 | Math (%): 100.00 | 1499.33 | |
○mkl_blas_def_dtrmml2x2_lun | libmkl_def.so.2 | 0.06 | 0.17 | 0.13 | 192 | 0.01 | 0.02 | Math (%): 100.00 | 1487.83 | |
○mkl_lapack_xdlaswp | libmkl_core.so.2 | 0.05 | 0.13 | 0.1 | 192 | 0.01 | 0.01 | Math (%): 100.00 | 0.24 | |
○mkl_serv_trylock | libmkl_core.so.2 | 0.04 | 0.19 | 0.09 | 190 | 0.01 | 0.02 | Math (%): 100.00 | 0.00 | |
►_ZN16miniqmcreference17TwoBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE9recomputeERNS1_11ParticleSetE | exec | 0.04 | 0.13 | 0.09 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 177.02 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 328 - TwoBodyJastrowRef.h:268-398 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►Loop 329 - TwoBodyJastrowRef.h:268-398 - exec [...] | 0 | 0.01 | 0 | 144 | 0.00 | 0.00 | 0.00 | |||
○Loop 345 - TwoBodyJastrowRef.h:381-382 - exec | 0.01 | 0.05 | 0.02 | 192 | 0.01 | 0.01 | 45.19 | |||
○Loop 343 - TwoBodyJastrowRef.h:381-382 - exec | 0.01 | 0.04 | 0.02 | 192 | 0.01 | 0.01 | 49.44 | |||
○Loop 344 - TwoBodyJastrowRef.h:381-382 - exec | 0.01 | 0.05 | 0.02 | 192 | 0.01 | 0.01 | 50.63 | |||
○Loop 346 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 5 | 0.00 | 0.00 | 0.00 | |||
○Loop 356 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 7 | 0.00 | 0.00 | 0.00 | |||
○Loop 338 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0.02 | 0.01 | 192 | 0.00 | 0.00 | 218.38 | |||
○Loop 351 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 12 | 0.00 | 0.00 | 0.00 | |||
○Loop 353 - TwoBodyJastrowRef.h:375-376 - exec | 0 | 0 | 0 | 18 | 0.00 | 0.00 | 0.00 | |||
○Loop 340 - TwoBodyJastrowRef.h:388-391 - exec | 0 | 0.03 | 0.01 | 192 | 0.00 | 0.01 | 424.76 | |||
○Loop 335 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 12 | 0.00 | 0.00 | 0.00 | |||
○Loop 332 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 5 | 0.00 | 0.00 | 0.00 | |||
○Loop 333 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 6 | 0.00 | 0.00 | 0.00 | |||
○Loop 339 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 12 | 0.00 | 0.00 | 0.00 | |||
○Loop 342 - TwoBodyJastrowRef.h:375-376 - exec | 0 | 0.01 | 0 | 192 | 0.00 | 0.00 | 0.00 | |||
○Loop 330 - TwoBodyJastrowRef.h:388-391 - exec | 0 | 0 | 0 | 7 | 0.00 | 0.00 | 0.00 | |||
○Loop 334 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0.02 | 0.01 | 192 | 0.00 | 0.00 | 222.13 | |||
○Loop 331 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | |||
○Loop 349 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 11 | 0.00 | 0.00 | 0.00 | |||
○Loop 341 - TwoBodyJastrowRef.h:388-391 - exec | 0 | 0 | 0 | 5 | 0.00 | 0.00 | 0.00 | |||
○Loop 357 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0 | 0 | 53 | 0.00 | 0.00 | 0.00 | |||
○Loop 352 - TwoBodyJastrowRef.h:375-376 - exec | 0 | 0 | 0 | 8 | 0.00 | 0.00 | 0.00 | |||
○Loop 350 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 5 | 0.00 | 0.00 | 0.00 | |||
○Loop 347 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 8 | 0.00 | 0.00 | 0.00 | |||
○Loop 336 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0.02 | 0 | 192 | 0.00 | 0.00 | 0.00 | |||
○Loop 355 - stl_numeric.h:141-141 - exec | 0 | 0.01 | 0 | 189 | 0.00 | 0.00 | 0.00 | |||
○Loop 354 - stl_numeric.h:140-141 - exec [...] | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | |||
○Loop 348 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | |||
○Loop 337 - TwoBodyJastrowRef.h:397-398 - exec | 0 | 0 | 0 | 10 | 0.00 | 0.00 | 0.00 | |||
►_ZN16miniqmcreference17OneBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE9ratioGradERNS1_11ParticleSetEiRNS1_10TinyVectorIdLj3EEE | exec | 0.03 | 0.1 | 0.05 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 65.08 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 194 - OneBodyJastrowRef.h:192-193 - exec | 0.01 | 0.05 | 0.01 | 192 | 0.00 | 0.01 | 69.88 | |||
○Loop 195 - OneBodyJastrowRef.h:192-193 - exec | 0.01 | 0.03 | 0.01 | 191 | 0.00 | 0.01 | 71.38 | |||
○Loop 193 - OneBodyJastrowRef.h:192-193 - exec | 0.01 | 0.05 | 0.02 | 192 | 0.00 | 0.01 | 48.56 | |||
○Loop 199 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 202 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 197 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 198 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 196 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 190 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 62 | 0.00 | 0.00 | 0.00 | |||
○Loop 192 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0.01 | 0 | 110 | 0.00 | 0.00 | 0.00 | |||
○Loop 203 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 201 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 200 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 189 - stl_numeric.h:140-141 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 191 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○mkl_blas_def_xdswap | libmkl_def.so.2 | 0.03 | 0.09 | 0.07 | 192 | 0.01 | 0.02 | Math (%): 100.00 | 105.32 | |
○f64xsubf128 | libm.so.6 | 0.03 | 0.11 | 0.07 | 192 | 0.01 | 0.02 | Math (%): 100.00 | 281.32 | |
►_ZN11qmcplusplus9TimerTypeINSt6chrono3_V212system_clockEE5startEv | exec | 0.03 | 0.12 | 0.06 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 16.69 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D restrict=__restrict__ -I /beegfs/hackathon/users/eoseret/qaas_runs/170-855-3059/intel/min... |
○Loop 1524 - NewTimer.cpp:53-54 - exec | 0 | 0.01 | 0 | 54 | 0.00 | 0.00 | 0.00 | |||
○mkl_serv_trylock@plt | libmkl_core.so.2 | 0.03 | 0.09 | 0.06 | 190 | 0.01 | 0.02 | Math (%): 100.00 | 0.00 | |
►_ZN11qmcplusplus9TimerTypeINSt6chrono3_V212system_clockEE4stopEv | exec | 0.02 | 0.23 | 0.05 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 33.83 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D restrict=__restrict__ -I /beegfs/hackathon/users/eoseret/qaas_runs/170-855-3059/intel/min... |
○Loop 1525 - stl_tree.h:780-1905 - exec [...] | 0 | 0.04 | 0 | 2 | 0.01 | 0.01 | 0.00 | |||
○Loop 1526 - stl_tree.h:780-1905 - exec [...] | 0 | 0.09 | 0 | 2 | 0.00 | 0.01 | 0.00 | |||
○Loop 1527 - NewTimer.cpp:99-100 - exec | 0 | 0 | 0 | 105 | 0.00 | 0.00 | 0.00 | |||
►_ZN11qmcplusplus12WaveFunction14evaluateRatiosERNS_18VirtualParticleSetERSt6vectorIdSaIdEE | exec | 0.02 | 0.07 | 0.04 | 192 | 0.01 | 0.01 | Exe (%): 100.00 | 30.94 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 112 - WaveFunction.cpp:269-274 - exec [...] | 0.01 | 0.04 | 0.01 | 191 | 0.00 | 0.01 | 62.38 | |||
○Loop 115 - WaveFunction.cpp:273-274 - exec | 0 | 0.02 | 0 | 183 | 0.00 | 0.00 | 0.00 | |||
○Loop 113 - WaveFunction.cpp:273-274 - exec | 0 | 0 | 0 | 15 | 0.00 | 0.00 | 0.00 | |||
○Loop 114 - WaveFunction.cpp:273-274 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○__tls_get_addr | ld-linux-x86-64.so.2 | 0.02 | 0.09 | 0.04 | 192 | 0.01 | 0.02 | System (%): 99.44 OMP (%): 0.56 | 24.56 | |
►_ZN16miniqmcreference17einspline_spo_refIdE8evaluateERKN11qmcplusplus11ParticleSetEiRNS2_6VectorIdSaIdEEE | exec | 0.02 | 0.1 | 0.04 | 192 | 0.01 | 0.02 | Exe (%): 100.00 | 9.38 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 829 - einspline_spo_ref.hpp:183-187 - exec [...] | 0.01 | 0.06 | 0.02 | 187 | 0.01 | 0.01 | 2.81 | |||
○_ZNK10__cxxabiv121__vmi_class_type_info12__do_dyncastElNS_17__class_type_info10__sub_kindEPKS1_PKvS4_S6_RNS1_16__dyncast_resultE | libstdc++.so.6.0.29 | 0.01 | 0.04 | 0.02 | 192 | 0.00 | 0.01 | Others (%): 100.00 | 122.81 | |
○mkl_serv_thread_yield@plt | libmkl_core.so.2 | 0.01 | 0.04 | 0.01 | 171 | 0.00 | 0.01 | Math (%): 100.00 | 0.00 | |
►_ZN11qmcplusplus11ParticleSet6updateEb | exec | 0.01 | 0.06 | 0.03 | 192 | 0.01 | 0.01 | Exe (%): 100.00 | 2.92 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I... |
○Loop 1143 - ParticleSet.cpp:242-243 - exec [...] | 0.01 | 0.04 | 0.02 | 186 | 0.00 | 0.01 | 0.31 | |||
○__dynamic_cast | libstdc++.so.6.0.29 | 0.01 | 0.05 | 0.03 | 192 | 0.01 | 0.01 | Others (%): 100.00 | 55.79 | |
►_ZN11qmcplusplus6VectorIdSaIdEE6resizeEmd | exec | 0.01 | 0.04 | 0.02 | 192 | 0.00 | 0.01 | Exe (%): 100.00 | 143.75 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 237 - stl_algobase.h:924-924 - exec | 0.01 | 0.04 | 0.02 | 192 | 0.00 | 0.01 | 143.75 | |||
○Loop 239 - stl_algobase.h:924-924 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 235 - stl_algobase.h:923-924 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 236 - stl_algobase.h:924-924 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 234 - stl_algobase.h:924-924 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 238 - stl_algobase.h:923-924 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○mkl_lapack_dgetri | libmkl_core.so.2 | 0.01 | 0.04 | 0.01 | 192 | 0.00 | 0.01 | Math (%): 100.00 | 175.38 | |
○mkl_blas_def_xdaxpy | libmkl_def.so.2 | 0.01 | 0.07 | 0.01 | 192 | 0.01 | 0.01 | Math (%): 100.00 | 270.38 | |
►_ZN11qmcplusplus12WaveFunction9ratioGradERNS_11ParticleSetEiRNS_10TinyVectorIdLj3EEE | exec | 0.01 | 0.03 | 0.01 | 192 | 0.00 | 0.01 | Exe (%): 100.00 | 170.50 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 108 - WaveFunction.cpp:198-201 - exec [...] | 0 | 0.03 | 0.01 | 190 | 0.00 | 0.01 | 103.13 | |||
○_ZN16miniqmcreference19DiracDeterminantRefIN11qmcplusplus13DelayedUpdateIddEEE14evaluateRatiosERNS1_18VirtualParticleSetERSt6vectorIdSaIdEE | exec | 0.01 | 0.05 | 0.02 | 184 | 0.00 | 0.01 | Exe (%): 100.00 | 6.06 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►_ZN11qmcplusplus6VectorINS_10TinyVectorIdLj3EEESaIS2_EE6resizeEmS2_ | exec | 0.01 | 0.05 | 0.03 | 192 | 0.01 | 0.01 | Exe (%): 100.00 | 239.92 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 241 - stl_algobase.h:911-912 - exec | 0.01 | 0.05 | 0.03 | 192 | 0.01 | 0.01 | 239.92 | |||
○Loop 240 - stl_algobase.h:911-912 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○__kmp_get_ancestor_thread_num.part.60 | libomp.so | 0.01 | 0.05 | 0.01 | 188 | 0.00 | 0.01 | OMP (%): 100.00 | 30.63 | |
►_ZN16miniqmcreference17OneBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE9computeU3ERNS1_11ParticleSetEiPKd | exec | 0.01 | 0.04 | 0.01 | 182 | 0.00 | 0.01 | Exe (%): 100.00 | 28.88 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 260 - OneBodyJastrowRef.h:231-237 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 261 - OneBodyJastrowRef.h:214-219 - exec [...] | 0 | 0.04 | 0.01 | 179 | 0.00 | 0.01 | 27.13 | |||
►_ZN16miniqmcreference17einspline_spo_refIdE12evaluate_vghERKN11qmcplusplus11ParticleSetEi | exec | 0.01 | 0.06 | 0.02 | 188 | 0.00 | 0.01 | Exe (%): 100.00 | 5.50 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
○Loop 853 - einspline_spo_ref.hpp:206-208 - exec [...] | 0 | 0.03 | 0.01 | 144 | 0.00 | 0.00 | 10.50 | |||
○update_get_addr | ld-linux-x86-64.so.2 | 0.01 | 0.04 | 0.02 | 192 | 0.01 | 0.01 | System (%): 91.54 OMP (%): 8.46 | 47.00 | |
►_ZN16miniqmcreference17OneBodyJastrowRefIN11qmcplusplus14BsplineFunctorIdEEE9recomputeERNS1_11ParticleSetE | exec | 0.01 | 0.03 | 0.02 | 192 | 0.00 | 0.01 | Exe (%): 100.00 | 42.38 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D HAVE_MKL -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D OPENMP_NO_COMPLEX -D _MPICC_H -D res... |
►Loop 269 - OneBodyJastrowRef.h:109-194 - exec [...] | 0 | 0.01 | 0 | 44 | 0.00 | 0.00 | 0.00 | |||
○Loop 283 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 52 | 0.00 | 0.00 | 0.00 | |||
○Loop 274 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 277 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 278 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 272 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0.02 | 0.01 | 172 | 0.00 | 0.00 | 18.50 | |||
○Loop 279 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 282 - stl_numeric.h:140-141 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 275 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 280 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 273 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0.01 | 0 | 135 | 0.00 | 0.00 | 0.00 | |||
○Loop 276 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 271 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0.02 | 0.01 | 186 | 0.00 | 0.01 | 37.38 | |||
○Loop 284 - stl_numeric.h:141-141 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 281 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 270 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 55 | 0.00 | 0.00 | 0.00 | |||
○__kmp_get_global_thread_id_reg | libomp.so | 0.01 | 0.06 | 0.03 | 192 | 0.01 | 0.01 | OMP (%): 100.00 | 47.54 | |
○mkl_serv_thread_yield | libmkl_core.so.2 | 0.01 | 0.06 | 0.03 | 189 | 0.01 | 0.01 | Math (%): 100.00 | 0.00 | |
○mm_account_ptr_by_tid | libmkl_core.so.2 | 0.01 | 0.05 | 0.03 | 183 | 0.01 | 0.01 | Math (%): 100.00 | 0.75 | |
○mkl_blas_def_xdgemm_bdz | libmkl_def.so.2 | 0.01 | 0.04 | 0.01 | 192 | 0.00 | 0.01 | Math (%): 100.00 | 170.88 | |
►.omp_outlined..64 | exec | 0.01 | 0.06 | 0.02 | 189 | 0.00 | 0.01 | Exe (%): 100.00 | 23.19 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 --driver-mode=g++ -D ADD_ -D H5_USE_16_API -D HAVE_CONFIG_H -D MPICH_SKIP_MPICXX -D OMPI_SKIP_MPICXX -D _MPICC_H -D restrict=__restrict__ -I /beegfs/hac... |
►Loop 30 - new_allocator.h:111-145 - exec [...] | 0 | 0 | 0 | 3 | 0.00 | 0.00 | 0.00 | |||
►Loop 31 - miniqmc.cpp:425-461 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○Loop 32 - miniqmc.cpp:429-458 - exec [...] | 0.01 | 0.06 | 0.02 | 186 | 0.00 | 0.01 | 16.94 | |||
►Loop 33 - StdRandom.h:102-103 - exec [...] | 0 | 0 | 0 | 27 | 0.00 | 0.00 | 0.00 | |||
►Loop 34 - random.tcc:401-3367 - exec [...] | 0 | 0 | 0 | 38 | 0.00 | 0.00 | 0.00 | |||
○Loop 36 - random.tcc:411-414 - exec | 0 | 0 | 0 | 28 | 0.00 | 0.00 | 0.00 | |||
○Loop 35 - random.tcc:403-455 - exec [...] | 0 | 0 | 0 | 7 | 0.00 | 0.00 | 0.00 | |||
○Loop 38 - stl_algobase.h:911-912 - exec | 0 | 0 | 0 | 7 | 0.00 | 0.00 | 0.00 | |||
○Loop 37 - stl_algobase.h:911-912 - exec | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
○_dl_update_slotinfo | ld-linux-x86-64.so.2 | 0.01 | 0.07 | 0.03 | 192 | 0.01 | 0.01 | System (%): 100.00 | 89.79 |