| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
| ○Loop 34845 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 26.05 | 25.50 | 16.80 | NA | NA | NA | NA | NA | NA | NA | 1404.87 | NA | NA | NA | NA | NA |
| ○Loop 34844 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 21.98 | 21.47 | 14.15 | NA | NA | NA | NA | NA | NA | NA | 208.91 | NA | NA | NA | NA | NA |
| ○Loop 1794 | exec | SoaDistanceTableAAOMPTarget.h:440-442,TinyVector.h:145-145,TinyVector.h:182-182,OhmmsVector.h:223-223,VectorSoAContainer.h:244-244,VectorSoAContainer.h:263-263 | qmcplusplus::SoaDistanceTableAAOMPTarget::update(int) | Single | 13.09 | 12.52 | 8.25 | 1.10 | 1.00 | 4.00 | 3.67 | 1 | 0.00 | 25.00 | 0.72 | 3.67 | 3.33 | 3.67 | 0.92 | 1.00 |
| ○Loop 818 | exec | MultiBsplineRef.hpp:242-262 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Innermost | 12.39 | 11.76 | 7.75 | 1.00 | 1.00 | 1.00 | 1.14 | 1 | 100.00 | 100.00 | 394.46 | 12.00 | 12.00 | 12.00 | 12.00 | 10.50 |
| ○Loop 37166 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 10.34 | 9.98 | 6.57 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 812 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.56 | 9.03 | 5.95 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 86.51 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 809 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.40 | 8.95 | 5.90 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 87.29 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 810 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.23 | 8.89 | 5.86 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 87.83 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 811 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.27 | 8.87 | 5.84 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 87.93 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 2055 | exec | ParticleBConds3DSoa.h:280-298 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Innermost | 6.65 | 6.29 | 4.14 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 94.00 | 100.00 | 380.26 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 1262 | exec | ParticleBConds3DSoa.h:235-255 | void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) const | Single | 2.47 | 2.23 | 1.47 | 1.00 | 1.00 | 1.00 | 1.86 - 1.85 | 1 | 89.04 | 89.90 | 552.75 | 26.00 | 26.00 | 26.00 | 26.00 | 14.00 - 14.08 |
| ○Loop 303984 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 2.40 | 2.21 | 1.45 | NA | NA | NA | NA | NA | NA | NA | 100.72 | NA | NA | NA | NA | NA |
| ○Loop 324 | exec | BsplineFunctor.h:236-241 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 2.45 | 2.07 | 1.36 | 3.00 | 1.00 | 6.79 | 3.50 | 1 | 0.00 | 21.15 | 31.79 | 3.50 | 1.17 | 3.50 | 0.52 | 1.00 |
| ○Loop 34842 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 2.21 | 2.02 | 1.33 | NA | NA | NA | NA | NA | NA | NA | 1220.70 | NA | NA | NA | NA | NA |
| ○Loop 303664 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 1.42 | 1.21 | 0.80 | NA | NA | NA | NA | NA | NA | NA | 117.38 | NA | NA | NA | NA | NA |
| ○Loop 304781 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.36 | 1.15 | 0.75 | NA | NA | NA | NA | NA | NA | NA | 0.24 | NA | NA | NA | NA | NA |
| ○Loop 303980 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 1.08 | 0.94 | 0.62 | NA | NA | NA | NA | NA | NA | NA | 67.30 | NA | NA | NA | NA | NA |
| ○Loop 304835 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.10 | 0.93 | 0.61 | NA | NA | NA | NA | NA | NA | NA | 0.20 | NA | NA | NA | NA | NA |
| ○Loop 272 | exec | BsplineFunctor.h:291-298 | qmcplusplus::BsplineFunctor::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.65 | 0.56 | 0.37 | 1.25 | 1.00 | 5.00 | 1.25 | 2 | 0.00 | 22.92 | 44.45 | 1.25 | 1.00 | 1.25 | 0.25 | 1.00 |
| ○Loop 918 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.65 | 0.55 | 0.36 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 65.24 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 910 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.69 | 0.52 | 0.35 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 68.45 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 801 | exec | TinyVector.h:145-145,einspline_spo_ref.hpp:223-227,OhmmsVector.h:223-223,VectorSoAContainer.h:231-231,VectorSoAContainer.h:271-271,stl_vector.h:1131-1131,stl_algobase.h:238-238 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Innermost | 0.59 | 0.48 | 0.32 | 2.33 | 1.00 | 5.25 | 7.00 | 1 | 9.09 | 26.14 | 0.00 | 7.00 | 3.00 | 7.00 | 1.33 | 1.00 |
| ○Loop 802 | exec | inner_product.hpp:82-83 | qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector >&, qmcplusplus::Vector > const&, std::vector >&) | Innermost | 0.62 | 0.48 | 0.31 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 99.58 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 303666 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.53 | 0.42 | 0.28 | NA | NA | NA | NA | NA | NA | NA | 125.83 | NA | NA | NA | NA | NA |
| ○Loop 314 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.42 | 0.33 | 0.22 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 92.38 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 313 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.44 | 0.32 | 0.21 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 92.21 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 315 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.39 | 0.32 | 0.21 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 93.21 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 919 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.39 | 0.30 | 0.20 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 40.10 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 316 | exec | TwoBodyJastrowRef.h:324-331 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.35 | 0.27 | 0.18 | 1.00 | 1.00 | 1.00 | 1.43 | 1 | 97.14 | 97.86 | 203.48 | 10.00 | 10.00 | 10.00 | 10.00 | 7.00 |
| ○Loop 42542 | libarmpl_lp64.so | | dswap_ | Single | 0.32 | 0.25 | 0.16 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 303660 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 0.32 | 0.25 | 0.16 | NA | NA | NA | NA | NA | NA | NA | 102.63 | NA | NA | NA | NA | NA |
| ○Loop 816 | exec | MultiBsplineRef.hpp:276-286 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Single | 0.32 | 0.23 | 0.15 | 1.00 | 1.00 | 1.00 | 2.00 | 1 | 100.00 | 100.00 | 288.86 | 9.00 | 9.00 | 9.00 | 9.00 | 4.50 |
| ○Loop 304604 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.31 | 0.19 | 0.13 | NA | NA | NA | NA | NA | NA | NA | 0.30 | NA | NA | NA | NA | NA |
| ○Loop 300 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.25 | 0.18 | 0.12 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 136.16 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 301 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.22 | 0.17 | 0.11 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 142.60 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 302 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.22 | 0.16 | 0.11 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 148.52 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 63 | exec | NonLocalPP.hpp:126-126,NonLocalPP.hpp:129-135,OhmmsVector.h:229-229,ParticleSet.h:277-277,stl_vector.h:993-993,stl_vector.h:1131-1131,stl_vector.h:1150-1150,unique_ptr.h:193-193 | qmcplusplus::NonLocalPP::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | Outermost | 0.20 | 0.12 | 0.08 | 9.00 | 1.00 | 5.82 | 24.00 | 1 | 0.00 | 22.79 | 9.17 | 6.00 | 0.67 | 6.00 | 1.03 | 0.25 |
| ○Loop 892 | exec | inner_product.hpp:211-212 | qmcplusplus::DiracMatrix::invert_transpose(qmcplusplus::Matrix > const&, qmcplusplus::Matrix >&, double&, double&) | Innermost | 0.12 | 0.10 | 0.06 | 1.00 | 1.00 | 4.00 | 2.67 | 1 | 0.00 | 25.00 | 0.00 | 2.67 | 2.67 | 2.67 | 0.67 | 1.00 |
| ○Loop 912 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.14 | 0.10 | 0.06 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 125.38 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 904 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.13 | 0.09 | 0.06 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 80.05 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 913 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.13 | 0.09 | 0.06 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 417.67 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 271 | exec | stl_vector.h:1150-1150,BsplineFunctor.h:303-336 | qmcplusplus::BsplineFunctor::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.11 | 0.06 | 0.04 | 1.00 | 1.00 | 1.01 | 1.19 | 1 | 91.14 | 98.10 | 573.80 | 25.50 | 25.50 | 25.50 | 25.13 | 21.50 |
| ○Loop 304833 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.09 | 0.06 | 0.04 | NA | NA | NA | NA | NA | NA | NA | 0.18 | NA | NA | NA | NA | NA |
| ○Loop 277 | exec | BsplineFunctor.h:236-241 | qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) const | Single | 0.10 | 0.06 | 0.04 | 1.80 | 1.00 | 6.00 | 6.00 | 1 | 0.00 | 20.83 | 6.45 | 1.50 | 0.83 | 1.50 | 0.25 | 0.25 |
| ○Loop 326 | exec | stl_vector.h:1150-1150,BsplineFunctor.h:246-260 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.09 | 0.05 | 0.03 | 1.00 | 1.00 | 1.03 | 1.21 | 1 | 75.00 | 93.75 | 797.91 | 14.50 | 14.50 | 14.50 | 14.13 | 12.00 |
| ○Loop 34841 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 0.09 | 0.04 | 0.03 | NA | NA | NA | NA | NA | NA | NA | 1099.40 | NA | NA | NA | NA | NA |
| ○Loop 803 | exec | OhmmsVector.h:178-178,SPOSet.h:83-86,ParticleSet.h:277-277,stl_vector.h:1131-1131,inner_product.hpp:82-82 | qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector >&, qmcplusplus::Vector > const&, std::vector >&) | InBetween | 0.08 | 0.04 | 0.02 | 1.67 | 1.00 | 5.27 | 2.17 | 1 | 0.00 | 23.33 | 41.15 | 3.63 | 2.17 | 3.63 | 0.69 | 1.67 |
| ○Loop 2053 | exec | SoaDistanceTableABOMPTarget.h:215-215,SoaDistanceTableABOMPTarget.h:222-222,SoaDistanceTableABOMPTarget.h:228-228,ParticleBConds3DSoa.h:298-298 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | InBetween | 0.06 | 0.03 | 0.02 | 3.15 | 1.00 | 2.43 | 3.15 | 20 | 2.59 | 24.03 | 164.19 | 26.75 | 8.50 | 26.75 | 11.00 | 8.50 |
| ○Loop 323 | exec | BsplineFunctor.h:238-241 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.07 | 0.03 | 0.02 | 1.20 | 1.00 | 4.00 | 4.00 | 1 | 0.00 | 25.00 | 72.77 | 1.00 | 0.83 | 1.00 | 0.25 | 0.25 |
| ○Loop 334699 | libarmpl_lp64.so | | void armpl::clag::lu_unblocked_direct_kernel(long, long, double*, long, int*, int&) | Innermost | 0.06 | 0.03 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 281.30 | NA | NA | NA | NA | NA |
| ○Loop 304754 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.05 | 0.03 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 6.61 | NA | NA | NA | NA | NA |
| ○Loop 224 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.05 | 0.03 | 0.02 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 72.76 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 307 | exec | stl_numeric.h:140-141 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.06 | 0.02 | 0.02 | 1.00 | 1.00 | 1.12 | 1.00 | 1 | 80.00 | 85.00 | 415.32 | 2.00 | 2.00 | 2.00 | 1.79 | 2.00 |
| ○Loop 905 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.04 | 0.02 | 0.02 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 88.36 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 2052 | exec | SoaDistanceTableABOMPTarget.h:214-214 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Outermost | 0.05 | 0.02 | 0.01 | 1.21 | 1.00 | 1.21 | 34.00 | 20 | 58.82 | 63.53 | 64.51 | 17.00 | 14.00 | 17.00 | 14.00 | 0.50 |
| ○Loop 245 | exec | OneBodyJastrowRef.h:134-135,OhmmsVector.h:223-223,OhmmsVector.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131,stl_vector.h:1150-1150 | miniqmcreference::OneBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Single | 0.04 | 0.02 | 0.01 | 1.42 | 1.70 | 6.18 | 1.42 | 1 | 0.00 | 22.50 | 2.12 | 2.83 | 2.00 | 1.67 | 0.46 | 2.00 |
| ○Loop 226 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.04 | 0.02 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 70.98 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 225 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.04 | 0.02 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 79.47 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 322 | exec | OhmmsVector.h:223-223,OhmmsVector.h:229-229,TwoBodyJastrowRef.h:108-108,TwoBodyJastrowRef.h:126-127,ParticleSet.h:313-313,BsplineFunctor.h:233-233,BsplineFunctor.h:236-236 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.04 | 0.02 | 0.01 | 3.13 | 1.00 | 1.14 | 6.25 | 1 | 0.00 | 43.75 | 12.80 | 3.13 | 1.00 | 3.13 | 2.75 | 0.50 |
| ○Loop 33845 | libarmpl_lp64.so | | daxpby_sve_kernel | Single | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 186.28 | NA | NA | NA | NA | NA |
| ○Loop 325 | exec | TwoBodyJastrowRef.h:132-132,BsplineFunctor.h:236-241,BsplineFunctor.h:246-246 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 34.30 | NA | NA | NA | NA | NA |
| ○Loop 339 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.06 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 93.20 | NA | NA | NA | NA | NA |
| ○Loop 337 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 101.05 | NA | NA | NA | NA | NA |
| ○Loop 366398 | libarmpl_lp64.so | | void armpl::clag::(anonymous namespace)::trsm_kernel(double const*, long, long, double*, long, long, long, long) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 68.74 | NA | NA | NA | NA | NA |
| ○Loop 114 | exec | OperatorTags.h:94-94,WaveFunction.cpp:185-188,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 1.71 | NA | NA | NA | NA | NA |
| ○Loop 338 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 103.63 | NA | NA | NA | NA | NA |
| ○Loop 799 | exec | stl_algo.h:709-709,einspline_spo_ref.hpp:183-187,stl_algobase.h:238-238,stl_algobase.h:413-413,stl_algobase.h:450-452 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | Single | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 28 | exec | TinyVector.h:62-62,refwrap.h:351-351,stl_vector.h:1131-1131,miniqmc.cpp:416-416,miniqmc.cpp:429-429,miniqmc.cpp:432-434,miniqmc.cpp:437-437,miniqmc.cpp:440-443,miniqmc.cpp:446-446,miniqmc.cpp:449-454,miniqmc.cpp:457-458 | main.omp_outlined.62 | Outermost | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.28 | NA | NA | NA | NA | NA |
| ○Loop 334 | exec | TwoBodyJastrowRef.h:388-391 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 209.18 | NA | NA | NA | NA | NA |
| ○Loop 304782 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 249 | exec | stl_algobase.h:951-952 | qmcplusplus::Vector >::resize(unsigned long, double) | Single | 0.03 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 813 | exec | MultiBsplineRef.hpp:63-63,MultiBsplineRef.hpp:66-68 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Outermost | 0.03 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 55.14 | NA | NA | NA | NA | NA |
| ○Loop 1113 | exec | ParticleSet.cpp:242-243,stl_vector.h:993-993,unique_ptr.h:193-193 | qmcplusplus::ParticleSet::update(bool) | Single | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 303661 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 59.14 | NA | NA | NA | NA | NA |
| ○Loop 253 | exec | stl_algobase.h:939-940 | qmcplusplus::Vector, std::allocator > >::resize(unsigned long, qmcplusplus::TinyVector) | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 808 | exec | einspline_spo_ref.hpp:175-176,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::einspline_spo_ref::evaluate_v(qmcplusplus::ParticleSet const&, int) | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 7.94 | NA | NA | NA | NA | NA |
| ○Loop 814 | exec | einspline_spo_ref.hpp:206-208,VectorSoAContainer.h:265-265,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::einspline_spo_ref::evaluate_vgh(qmcplusplus::ParticleSet const&, int) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 9.12 | NA | NA | NA | NA | NA |
| ○Loop 304547 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.22 | NA | NA | NA | NA | NA |
| ○Loop 119 | exec | WaveFunction.cpp:269-269,WaveFunction.cpp:272-273,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 115 | exec | WaveFunction.cpp:198-201,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.36 | NA | NA | NA | NA | NA |
| ○Loop 64 | exec | OperatorTags.h:43-43,OperatorTags.h:63-63,TinyVector.h:144-145,NonLocalPP.hpp:131-132,OhmmsVector.h:223-223,OhmmsVector.h:229-229,VectorSoAContainer.h:231-231,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::NonLocalPP::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.11 | NA | NA | NA | NA | NA |
| ○Loop 304492 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 351 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 113.87 | NA | NA | NA | NA | NA |
| ○Loop 354 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 137.36 | NA | NA | NA | NA | NA |
| ○Loop 321 | exec | TwoBodyJastrowRef.h:107-108,TwoBodyJastrowRef.h:126-127,refwrap.h:351-351,optional:469-469,optional:991-991,stl_vector.h:993-993,stl_vector.h:1131-1131 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 15.46 | NA | NA | NA | NA | NA |
| ○Loop 1478 | exec | stl_tree.h:1947-1947,NewTimer.h:119-119 | qmcplusplus::TimerType::stop() | Innermost | 0.14 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.10 | NA | NA | NA | NA | NA |
| ○Loop 304756 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 59.71 | NA | NA | NA | NA | NA |
| ○Loop 270 | exec | OneBodyJastrowRef.h:214-219,OhmmsVector.h:223-223,shared_ptr_base.h:1667-1667,ParticleSet.h:313-313,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::OneBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet&, int, double const*) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.27 | NA | NA | NA | NA | NA |
| ○Loop 348 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 135.86 | NA | NA | NA | NA | NA |
| ○Loop 819 | exec | MultiBsplineRef.hpp:226-227,MultiBsplineRef.hpp:234-239,MultiBsplineRef.hpp:242-242 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Outermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 159.68 | NA | NA | NA | NA | NA |
| ○Loop 304018 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<11ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 110.16 | NA | NA | NA | NA | NA |
| ○Loop 37170 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 1476 | exec | stl_tree.h:1947-1947,NewTimer.h:119-119 | qmcplusplus::TimerType::stop() | Innermost | 0.07 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 2.12 | NA | NA | NA | NA | NA |
| ○Loop 922 | exec | OperatorTags.h:94-94,OhmmsVector.h:223-223,inner_product.hpp:82-82,inner_product.hpp:155-155,DiracDeterminantRef.cpp:173-173,DiracDeterminantRef.cpp:178-178 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 9.92 | NA | NA | NA | NA | NA |
| ○Loop 303983 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 58.98 | NA | NA | NA | NA | NA |
| ○Loop 308 | exec | OhmmsVector.h:223-223,TwoBodyJastrowRef.h:269-274,shared_ptr_base.h:1667-1667,ParticleSet.h:313-313,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 9.07 | NA | NA | NA | NA | NA |
| ○Loop 35071 | libarmpl_lp64.so | | ddot_kernel | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 29.49 | NA | NA | NA | NA | NA |
| ○Loop 34827 | libarmpl_lp64.so | | dgemm_reference_ | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 2.27 | NA | NA | NA | NA | NA |
| ○Loop 33846 | libarmpl_lp64.so | | daxpby_sve_kernel | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 63.52 | NA | NA | NA | NA | NA |
| ○Loop 1120 | exec | ParticleSet.cpp:343-344,stl_vector.h:993-993,unique_ptr.h:193-193 | qmcplusplus::ParticleSet::computeNewPosDistTables(int, qmcplusplus::TinyVector const&, bool) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 12.70 | NA | NA | NA | NA | NA |
| ○Loop 336 | exec | TwoBodyJastrowRef.h:375-376 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 174.67 | NA | NA | NA | NA | NA |
| ○Loop 37168 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 276 | exec | stl_vector.h:1150-1150,BsplineFunctor.h:246-260 | qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) const | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304605 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304493 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304032 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<10ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 5.29 | NA | NA | NA | NA | NA |
| ○Loop 304028 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<10ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 282 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 31.76 | NA | NA | NA | NA | NA |
| ○Loop 303659 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Outermost | 0.01 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 42.34 | NA | NA | NA | NA | NA |