| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
| ○Loop 34845 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 25.72 | 25.41 | 16.75 | NA | NA | NA | NA | NA | NA | NA | 1408.48 | NA | NA | NA | NA | NA |
| ○Loop 34844 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 22.11 | 21.49 | 14.16 | NA | NA | NA | NA | NA | NA | NA | 209.81 | NA | NA | NA | NA | NA |
| ○Loop 1794 | exec | SoaDistanceTableAAOMPTarget.h:440-442,TinyVector.h:145-145,TinyVector.h:182-182,OhmmsVector.h:223-223,VectorSoAContainer.h:244-244,VectorSoAContainer.h:263-263 | qmcplusplus::SoaDistanceTableAAOMPTarget::update(int) | Single | 12.89 | 12.50 | 8.24 | 1.10 | 1.00 | 4.00 | 3.67 | 1 | 0.00 | 25.00 | 0.72 | 3.67 | 3.33 | 3.67 | 0.92 | 1.00 |
| ○Loop 818 | exec | MultiBsplineRef.hpp:242-262 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Innermost | 12.22 | 11.69 | 7.70 | 1.00 | 1.00 | 1.00 | 1.14 | 1 | 100.00 | 100.00 | 396.69 | 12.00 | 12.00 | 12.00 | 12.00 | 10.50 |
| ○Loop 37166 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 10.53 | 10.01 | 6.60 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 812 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.50 | 9.06 | 5.97 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 86.15 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 809 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.43 | 9.00 | 5.93 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 86.69 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 810 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.40 | 8.88 | 5.85 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 87.88 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 811 | exec | MultiBsplineRef.hpp:68-71 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Innermost | 9.46 | 8.88 | 5.85 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 88.11 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 2055 | exec | ParticleBConds3DSoa.h:280-298 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Innermost | 6.68 | 6.24 | 4.11 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 94.00 | 100.00 | 383.05 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 1262 | exec | ParticleBConds3DSoa.h:235-255 | void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) const | Single | 2.48 | 2.23 | 1.47 | 1.00 | 1.00 | 1.00 | 1.86 - 1.85 | 1 | 89.04 | 89.90 | 552.13 | 26.00 | 26.00 | 26.00 | 26.00 | 14.00 - 14.08 |
| ○Loop 303984 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 2.40 | 2.18 | 1.44 | NA | NA | NA | NA | NA | NA | NA | 102.47 | NA | NA | NA | NA | NA |
| ○Loop 324 | exec | BsplineFunctor.h:236-241 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 2.25 | 2.07 | 1.37 | 3.00 | 1.00 | 6.79 | 3.50 | 1 | 0.00 | 21.15 | 31.77 | 3.50 | 1.17 | 3.50 | 0.52 | 1.00 |
| ○Loop 34842 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 2.16 | 2.03 | 1.34 | NA | NA | NA | NA | NA | NA | NA | 1214.60 | NA | NA | NA | NA | NA |
| ○Loop 303664 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 1.38 | 1.21 | 0.80 | NA | NA | NA | NA | NA | NA | NA | 117.55 | NA | NA | NA | NA | NA |
| ○Loop 304781 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.32 | 1.13 | 0.75 | NA | NA | NA | NA | NA | NA | NA | 0.25 | NA | NA | NA | NA | NA |
| ○Loop 303980 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 1.09 | 0.94 | 0.62 | NA | NA | NA | NA | NA | NA | NA | 66.88 | NA | NA | NA | NA | NA |
| ○Loop 304835 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.05 | 0.92 | 0.60 | NA | NA | NA | NA | NA | NA | NA | 0.17 | NA | NA | NA | NA | NA |
| ○Loop 918 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.67 | 0.55 | 0.36 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 65.35 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 272 | exec | BsplineFunctor.h:291-298 | qmcplusplus::BsplineFunctor::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.69 | 0.55 | 0.36 | 1.25 | 1.00 | 5.00 | 1.25 | 2 | 0.00 | 22.92 | 44.71 | 1.25 | 1.00 | 1.25 | 0.25 | 1.00 |
| ○Loop 910 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.73 | 0.53 | 0.35 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 68.11 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 802 | exec | inner_product.hpp:82-83 | qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector >&, qmcplusplus::Vector > const&, std::vector >&) | Innermost | 0.58 | 0.48 | 0.32 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 97.96 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 801 | exec | TinyVector.h:145-145,einspline_spo_ref.hpp:223-227,OhmmsVector.h:223-223,VectorSoAContainer.h:231-231,VectorSoAContainer.h:271-271,stl_vector.h:1131-1131,stl_algobase.h:238-238 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Innermost | 0.57 | 0.47 | 0.31 | 2.33 | 1.00 | 5.25 | 7.00 | 1 | 9.09 | 26.14 | 0.00 | 7.00 | 3.00 | 7.00 | 1.33 | 1.00 |
| ○Loop 303666 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.52 | 0.42 | 0.28 | NA | NA | NA | NA | NA | NA | NA | 124.77 | NA | NA | NA | NA | NA |
| ○Loop 314 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.40 | 0.33 | 0.22 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 92.19 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 315 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.40 | 0.33 | 0.22 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 91.47 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 313 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.42 | 0.32 | 0.21 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 95.24 | 96.43 | 94.87 | 6.00 | 6.00 | 6.00 | 6.00 | 4.00 |
| ○Loop 919 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.39 | 0.30 | 0.20 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 40.01 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 316 | exec | TwoBodyJastrowRef.h:324-331 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.39 | 0.27 | 0.18 | 1.00 | 1.00 | 1.00 | 1.43 | 1 | 97.14 | 97.86 | 199.69 | 10.00 | 10.00 | 10.00 | 10.00 | 7.00 |
| ○Loop 303660 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 0.34 | 0.25 | 0.16 | NA | NA | NA | NA | NA | NA | NA | 102.85 | NA | NA | NA | NA | NA |
| ○Loop 42542 | libarmpl_lp64.so | | dswap_ | Single | 0.34 | 0.25 | 0.16 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 816 | exec | MultiBsplineRef.hpp:276-286 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Single | 0.31 | 0.23 | 0.15 | 1.00 | 1.00 | 1.00 | 2.00 | 1 | 100.00 | 100.00 | 280.42 | 9.00 | 9.00 | 9.00 | 9.00 | 4.50 |
| ○Loop 304604 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.25 | 0.19 | 0.13 | NA | NA | NA | NA | NA | NA | NA | 0.35 | NA | NA | NA | NA | NA |
| ○Loop 300 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.25 | 0.18 | 0.12 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 136.36 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 301 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.25 | 0.17 | 0.11 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 143.35 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 302 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.26 | 0.15 | 0.10 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 156.47 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 63 | exec | NonLocalPP.hpp:126-126,NonLocalPP.hpp:129-135,OhmmsVector.h:229-229,ParticleSet.h:277-277,stl_vector.h:993-993,stl_vector.h:1131-1131,stl_vector.h:1150-1150,unique_ptr.h:193-193 | qmcplusplus::NonLocalPP::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | Outermost | 0.19 | 0.13 | 0.08 | 9.00 | 1.00 | 5.82 | 24.00 | 1 | 0.00 | 22.79 | 8.14 | 6.00 | 0.67 | 6.00 | 1.03 | 0.25 |
| ○Loop 912 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.15 | 0.09 | 0.06 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 128.67 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 892 | exec | inner_product.hpp:211-212 | qmcplusplus::DiracMatrix::invert_transpose(qmcplusplus::Matrix > const&, qmcplusplus::Matrix >&, double&, double&) | Innermost | 0.11 | 0.09 | 0.06 | 1.00 | 1.00 | 4.00 | 2.67 | 1 | 0.00 | 25.00 | 0.00 | 2.67 | 2.67 | 2.67 | 0.67 | 1.00 |
| ○Loop 904 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.14 | 0.09 | 0.06 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 84.49 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 913 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.13 | 0.08 | 0.06 | 1.00 | 1.75 | 1.47 | 1.00 | 1 | 81.82 | 59.09 | 426.30 | 2.00 | 2.00 | 1.14 | 1.36 | 2.00 |
| ○Loop 271 | exec | stl_vector.h:1150-1150,BsplineFunctor.h:303-336 | qmcplusplus::BsplineFunctor::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.12 | 0.06 | 0.04 | 1.00 | 1.00 | 1.01 | 1.19 | 1 | 91.14 | 98.10 | 607.75 | 25.50 | 25.50 | 25.50 | 25.13 | 21.50 |
| ○Loop 304833 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.11 | 0.06 | 0.04 | NA | NA | NA | NA | NA | NA | NA | 0.28 | NA | NA | NA | NA | NA |
| ○Loop 326 | exec | stl_vector.h:1150-1150,BsplineFunctor.h:246-260 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.10 | 0.06 | 0.04 | 1.00 | 1.00 | 1.03 | 1.21 | 1 | 75.00 | 93.75 | 690.67 | 14.50 | 14.50 | 14.50 | 14.13 | 12.00 |
| ○Loop 277 | exec | BsplineFunctor.h:236-241 | qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) const | Single | 0.08 | 0.05 | 0.04 | 1.80 | 1.00 | 6.00 | 6.00 | 1 | 0.00 | 20.83 | 6.66 | 1.50 | 0.83 | 1.50 | 0.25 | 0.25 |
| ○Loop 34841 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 0.07 | 0.04 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 1245.29 | NA | NA | NA | NA | NA |
| ○Loop 803 | exec | OhmmsVector.h:178-178,SPOSet.h:83-86,ParticleSet.h:277-277,stl_vector.h:1131-1131,inner_product.hpp:82-82 | qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector >&, qmcplusplus::Vector > const&, std::vector >&) | InBetween | 0.07 | 0.04 | 0.02 | 1.67 | 1.00 | 5.27 | 2.17 | 1 | 0.00 | 23.33 | 41.99 | 3.63 | 2.17 | 3.63 | 0.69 | 1.67 |
| ○Loop 323 | exec | BsplineFunctor.h:238-241 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.06 | 0.03 | 0.02 | 1.20 | 1.00 | 4.00 | 4.00 | 1 | 0.00 | 25.00 | 75.25 | 1.00 | 0.83 | 1.00 | 0.25 | 0.25 |
| ○Loop 2053 | exec | SoaDistanceTableABOMPTarget.h:215-215,SoaDistanceTableABOMPTarget.h:222-222,SoaDistanceTableABOMPTarget.h:228-228,ParticleBConds3DSoa.h:298-298 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | InBetween | 0.06 | 0.03 | 0.02 | 3.15 | 1.00 | 2.43 | 3.15 | 20 | 2.59 | 24.03 | 185.51 | 26.75 | 8.50 | 26.75 | 11.00 | 8.50 |
| ○Loop 224 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.06 | 0.03 | 0.02 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 74.28 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 304754 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.06 | 0.03 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 8.68 | NA | NA | NA | NA | NA |
| ○Loop 334699 | libarmpl_lp64.so | | void armpl::clag::lu_unblocked_direct_kernel(long, long, double*, long, int*, int&) | Innermost | 0.06 | 0.03 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 291.17 | NA | NA | NA | NA | NA |
| ○Loop 307 | exec | stl_numeric.h:140-141 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.05 | 0.02 | 0.02 | 1.00 | 1.00 | 1.12 | 1.00 | 1 | 80.00 | 85.00 | 374.94 | 2.00 | 2.00 | 2.00 | 1.79 | 2.00 |
| ○Loop 905 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.06 | 0.02 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 85.71 | 89.29 | 84.62 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 2052 | exec | SoaDistanceTableABOMPTarget.h:214-214 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Outermost | 0.05 | 0.02 | 0.01 | 1.21 | 1.00 | 1.21 | 34.00 | 20 | 58.82 | 63.53 | 67.24 | 17.00 | 14.00 | 17.00 | 14.00 | 0.50 |
| ○Loop 325 | exec | TwoBodyJastrowRef.h:132-132,BsplineFunctor.h:236-241,BsplineFunctor.h:246-246 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.05 | 0.02 | 0.01 | 1.03 | 1.00 | 1.21 | 9.07 | 1 | 63.77 | 60.51 | 29.89 | 11.33 | 11.00 | 11.33 | 9.38 | 1.25 |
| ○Loop 245 | exec | OneBodyJastrowRef.h:134-135,OhmmsVector.h:223-223,OhmmsVector.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131,stl_vector.h:1150-1150 | miniqmcreference::OneBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Single | 0.05 | 0.02 | 0.01 | 1.42 | 1.70 | 6.18 | 1.42 | 1 | 0.00 | 22.50 | 2.13 | 2.83 | 2.00 | 1.67 | 0.46 | 2.00 |
| ○Loop 33845 | libarmpl_lp64.so | | daxpby_sve_kernel | Single | 0.05 | 0.02 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 169.83 | NA | NA | NA | NA | NA |
| ○Loop 813 | exec | MultiBsplineRef.hpp:63-63,MultiBsplineRef.hpp:66-68 | void miniqmcreference::MultiBsplineEvalRef::evaluate_v(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, unsigned long) | Outermost | 0.05 | 0.02 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 44.93 | NA | NA | NA | NA | NA |
| ○Loop 226 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 70.00 | NA | NA | NA | NA | NA |
| ○Loop 225 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.06 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 77.74 | NA | NA | NA | NA | NA |
| ○Loop 337 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 88.02 | NA | NA | NA | NA | NA |
| ○Loop 114 | exec | OperatorTags.h:94-94,WaveFunction.cpp:185-188,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 1.16 | NA | NA | NA | NA | NA |
| ○Loop 28 | exec | TinyVector.h:62-62,refwrap.h:351-351,stl_vector.h:1131-1131,miniqmc.cpp:416-416,miniqmc.cpp:429-429,miniqmc.cpp:432-434,miniqmc.cpp:437-437,miniqmc.cpp:440-443,miniqmc.cpp:446-446,miniqmc.cpp:449-454,miniqmc.cpp:457-458 | main.omp_outlined.62 | Outermost | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.54 | NA | NA | NA | NA | NA |
| ○Loop 304782 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.24 | NA | NA | NA | NA | NA |
| ○Loop 322 | exec | OhmmsVector.h:223-223,OhmmsVector.h:229-229,TwoBodyJastrowRef.h:108-108,TwoBodyJastrowRef.h:126-127,ParticleSet.h:313-313,BsplineFunctor.h:233-233,BsplineFunctor.h:236-236 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 11.74 | NA | NA | NA | NA | NA |
| ○Loop 338 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.06 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 96.72 | NA | NA | NA | NA | NA |
| ○Loop 366398 | libarmpl_lp64.so | | void armpl::clag::(anonymous namespace)::trsm_kernel(double const*, long, long, double*, long, long, long, long) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 75.90 | NA | NA | NA | NA | NA |
| ○Loop 799 | exec | stl_algo.h:709-709,einspline_spo_ref.hpp:183-187,stl_algobase.h:238-238,stl_algobase.h:413-413,stl_algobase.h:450-452 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 339 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.07 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 87.34 | NA | NA | NA | NA | NA |
| ○Loop 334 | exec | TwoBodyJastrowRef.h:388-391 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.03 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 234.45 | NA | NA | NA | NA | NA |
| ○Loop 303661 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 67.75 | NA | NA | NA | NA | NA |
| ○Loop 253 | exec | stl_algobase.h:939-940 | qmcplusplus::Vector, std::allocator > >::resize(unsigned long, qmcplusplus::TinyVector) | Single | 0.03 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 1113 | exec | ParticleSet.cpp:242-243,stl_vector.h:993-993,unique_ptr.h:193-193 | qmcplusplus::ParticleSet::update(bool) | Single | 0.03 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 814 | exec | einspline_spo_ref.hpp:206-208,VectorSoAContainer.h:265-265,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::einspline_spo_ref::evaluate_vgh(qmcplusplus::ParticleSet const&, int) | Single | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 5.82 | NA | NA | NA | NA | NA |
| ○Loop 64 | exec | OperatorTags.h:43-43,OperatorTags.h:63-63,TinyVector.h:144-145,NonLocalPP.hpp:131-132,OhmmsVector.h:223-223,OhmmsVector.h:229-229,VectorSoAContainer.h:231-231,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::NonLocalPP::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.01 | NA | NA | NA | NA | NA |
| ○Loop 249 | exec | stl_algobase.h:951-952 | qmcplusplus::Vector >::resize(unsigned long, double) | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 808 | exec | einspline_spo_ref.hpp:175-176,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::einspline_spo_ref::evaluate_v(qmcplusplus::ParticleSet const&, int) | Single | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 3.97 | NA | NA | NA | NA | NA |
| ○Loop 119 | exec | WaveFunction.cpp:269-269,WaveFunction.cpp:272-273,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304547 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 115 | exec | WaveFunction.cpp:198-201,refwrap.h:351-351,NewTimer.h:242-242,NewTimer.h:249-249,stl_vector.h:993-993,stl_vector.h:1131-1131 | qmcplusplus::WaveFunction::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 321 | exec | TwoBodyJastrowRef.h:107-108,TwoBodyJastrowRef.h:126-127,refwrap.h:351-351,optional:469-469,optional:991-991,stl_vector.h:993-993,stl_vector.h:1131-1131 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 12.49 | NA | NA | NA | NA | NA |
| ○Loop 304756 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 41.56 | NA | NA | NA | NA | NA |
| ○Loop 1478 | exec | stl_tree.h:1947-1947,NewTimer.h:119-119 | qmcplusplus::TimerType::stop() | Innermost | 0.17 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 819 | exec | MultiBsplineRef.hpp:226-227,MultiBsplineRef.hpp:234-239,MultiBsplineRef.hpp:242-242 | void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh(qmcplusplus::bspline_traits::SplineType const*, double, double, double, double*, double*, double*, unsigned long) | Outermost | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 185.10 | NA | NA | NA | NA | NA |
| ○Loop 351 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 83.43 | NA | NA | NA | NA | NA |
| ○Loop 354 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 115.48 | NA | NA | NA | NA | NA |
| ○Loop 304018 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<11ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 127.83 | NA | NA | NA | NA | NA |
| ○Loop 270 | exec | OneBodyJastrowRef.h:214-219,OhmmsVector.h:223-223,shared_ptr_base.h:1667-1667,ParticleSet.h:313-313,stl_vector.h:1131-1131,stl_vector.h:1263-1263 | miniqmcreference::OneBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet&, int, double const*) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.88 | NA | NA | NA | NA | NA |
| ○Loop 304492 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 348 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 109.26 | NA | NA | NA | NA | NA |
| ○Loop 1476 | exec | stl_tree.h:1947-1947,NewTimer.h:119-119 | qmcplusplus::TimerType::stop() | Innermost | 0.08 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 35071 | libarmpl_lp64.so | | ddot_kernel | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 35.45 | NA | NA | NA | NA | NA |
| ○Loop 37170 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Outermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304032 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<10ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 336 | exec | TwoBodyJastrowRef.h:375-376 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 199.79 | NA | NA | NA | NA | NA |
| ○Loop 37169 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304493 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 34827 | libarmpl_lp64.so | | dgemm_reference_ | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 37167 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 303659 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 7.95 | NA | NA | NA | NA | NA |
| ○Loop 303979 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 15.89 | NA | NA | NA | NA | NA |
| ○Loop 37168 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |