| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
| ○Loop 914 | exec | MultiBsplineRef.hpp:68-71 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | Innermost | 36.76 | 35.78 | 23.34 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 97.87 | 6.00 | 6.00 | 6.00 | 6.00 | 6.00 |
| ○Loop 34845 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 25.71 | 24.98 | 16.29 | NA | NA | NA | NA | NA | NA | NA | 1433.88 | NA | NA | NA | NA | NA |
| ○Loop 34844 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 21.51 | 21.13 | 13.78 | NA | NA | NA | NA | NA | NA | NA | 212.52 | NA | NA | NA | NA | NA |
| ○Loop 946 | exec | MultiBsplineRef.hpp:242-262 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Innermost | 15.37 | 15.12 | 9.86 | 1.09 | 1.09 | 4.00 | 1.40 | 1 | 0.00 | 25.00 | 317.70 | 8.75 | 8.00 | 8.00 | 2.19 | 6.25 |
| ○Loop 2426 | exec | SoaDistanceTableAAOMPTarget.h:440-442,TinyVector.h:182-182,VectorSoAContainer.h:244-244,VectorSoAContainer.h:263-263 | qmcplusplus::SoaDistanceTableAAOMPTarget::update(int) | Single | 12.65 | 12.13 | 7.91 | 1.38 | 1.00 | 4.00 | 7.33 | 1 | 0.00 | 25.00 | 0.75 | 7.33 | 5.33 | 7.33 | 1.83 | 1.00 |
| ○Loop 37166 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 10.11 | 9.73 | 6.35 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 3036 | exec | SoaDistanceTableABOMPTarget.h:228-228,ParticleBConds3DSoa.h:280-298 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Innermost | 8.32 | 7.83 | 5.11 | 2.92 | 2.50 | 4.00 | 1.25 | 1 | 0.00 | 25.00 | 306.21 | 8.75 | 3.00 | 3.50 | 2.19 | 7.00 |
| ○Loop 303984 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 2.37 | 2.20 | 1.43 | NA | NA | NA | NA | NA | NA | NA | 101.15 | NA | NA | NA | NA | NA |
| ○Loop 295 | exec | BsplineFunctor.h:236-241 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 2.32 | 2.07 | 1.35 | 1.59 | 1.00 | 4.25 | 17.00 | 6561 | 0.00 | 20.19 | 34.25 | 8.50 | 5.33 | 8.50 | 2.00 | 0.50 |
| ○Loop 34842 | libarmpl_lp64.so | | dgemm_sve_big | Innermost | 2.12 | 2.00 | 1.30 | NA | NA | NA | NA | NA | NA | NA | 1233.32 | NA | NA | NA | NA | NA |
| ○Loop 303664 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 1.41 | 1.21 | 0.79 | NA | NA | NA | NA | NA | NA | NA | 118.81 | NA | NA | NA | NA | NA |
| ○Loop 304781 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.24 | 1.12 | 0.73 | NA | NA | NA | NA | NA | NA | NA | 0.38 | NA | NA | NA | NA | NA |
| ○Loop 530 | exec | TwoBodyJastrowRef.h:342-347 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Innermost | 1.12 | 0.95 | 0.62 | 1.00 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 112.50 | 3.00 | 3.00 | 3.00 | 3.00 | 2.00 |
| ○Loop 303980 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Innermost | 1.13 | 0.93 | 0.61 | NA | NA | NA | NA | NA | NA | NA | 67.91 | NA | NA | NA | NA | NA |
| ○Loop 2564 | exec | ParticleBConds3DSoa.h:237-255 | qmcplusplus::SoaDistanceTableAAOMPTarget::move(qmcplusplus::ParticleSet const&, qmcplusplus::TinyVector const&, int, bool) | Single | 1.08 | 0.91 | 0.59 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 65.57 | 77.87 | 599.68 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 2563 | exec | ParticleBConds3DSoa.h:237-255 | qmcplusplus::SoaDistanceTableAAOMPTarget::move(qmcplusplus::ParticleSet const&, qmcplusplus::TinyVector const&, int, bool) | Single | 1.10 | 0.88 | 0.58 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 65.57 | 77.87 | 613.20 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 304835 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 1.01 | 0.88 | 0.57 | NA | NA | NA | NA | NA | NA | NA | 0.21 | NA | NA | NA | NA | NA |
| ○Loop 944 | exec | MultiBsplineRef.hpp:276-286 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Innermost | 0.75 | 0.63 | 0.41 | 1.00 | 1.00 | 4.00 | 2.67 | 1 | 0.00 | 25.00 | 104.99 | 12.00 | 12.00 | 12.00 | 3.00 | 4.50 |
| ○Loop 1107 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.66 | 0.52 | 0.34 | 1.00 | 1.41 | 1.45 | 1.00 | 1 | 100.00 | 60.87 | 69.19 | 4.00 | 4.00 | 2.83 | 2.76 | 4.00 |
| ○Loop 229 | exec | TwoBodyJastrowRef.h:155-156 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Innermost | 0.69 | 0.49 | 0.32 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 147.29 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 1103 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94 | miniqmcreference::DiracDeterminantRef >::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.58 | 0.47 | 0.31 | 1.00 | 1.71 | 1.42 | 1.00 | 1 | 100.00 | 63.64 | 76.54 | 4.00 | 4.00 | 2.33 | 2.81 | 4.00 |
| ○Loop 1102 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.59 | 0.45 | 0.29 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 107.44 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 303666 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.51 | 0.41 | 0.27 | NA | NA | NA | NA | NA | NA | NA | 126.70 | NA | NA | NA | NA | NA |
| ○Loop 1108 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Innermost | 0.39 | 0.30 | 0.20 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 39.57 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 233 | exec | BsplineFunctor.h:291-298 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Innermost | 0.37 | 0.30 | 0.20 | 1.92 | 1.00 | 5.13 | 13.67 | 6561 | 0.00 | 18.88 | 54.28 | 10.25 | 5.33 | 10.25 | 2.00 | 0.75 |
| ○Loop 527 | exec | stl_vector.h:1128-1128,TwoBodyJastrowRef.h:324-331 | miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.41 | 0.30 | 0.19 | 1.00 | 1.00 | 4.00 | 1.67 | 1 | 0.00 | 25.00 | 182.47 | 13.33 | 13.33 | 13.33 | 3.33 | 8.00 |
| ○Loop 303660 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Innermost | 0.30 | 0.24 | 0.16 | NA | NA | NA | NA | NA | NA | NA | 105.78 | NA | NA | NA | NA | NA |
| ○Loop 42542 | libarmpl_lp64.so | | dswap_ | Single | 0.27 | 0.24 | 0.15 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 943 | exec | TinyVector.h:145-145,einspline_spo_ref.hpp:223-227 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Innermost | 0.30 | 0.20 | 0.13 | 1.00 | 1.00 | 3.60 | 12.00 | 1 | 11.11 | 27.78 | 0.00 | 12.00 | 12.00 | 12.00 | 3.33 | 1.00 |
| ○Loop 304604 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.26 | 0.19 | 0.12 | NA | NA | NA | NA | NA | NA | NA | 0.24 | NA | NA | NA | NA | NA |
| ○Loop 525 | exec | BsplineFunctor.h:291-298 | miniqmcreference::TwoBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet const&, int, double const*, double*, double*, double*, bool) | Innermost | 0.24 | 0.18 | 0.11 | 1.92 | 1.00 | 5.13 | 13.67 | 6561 | 0.00 | 19.32 | 48.76 | 10.25 | 5.33 | 10.25 | 2.00 | 0.75 |
| ○Loop 58 | exec | OperatorTags.h:43-43,OperatorTags.h:63-63,TinyVector.h:145-145,NonLocalPP.hpp:126-126,NonLocalPP.hpp:129-135,stl_vector.h:993-993,stl_vector.h:1131-1131,OhmmsVector.h:229-229,VectorSoAContainer.h:231-231,ParticleSet.h:277-277,unique_ptr.h:193-193 | main._omp_fn.1 | InBetween | 0.19 | 0.14 | 0.09 | 2.04 | 1.63 | 1.39 | 6.33 | 6 | 11.43 | 28.93 | 8.05 | 9.50 | 4.67 | 5.83 | 6.81 | 1.50 |
| ○Loop 2818 | exec | ParticleBConds3DSoa.h:237-255 | qmcplusplus::SoaDistanceTableABOMPTarget::move(qmcplusplus::ParticleSet const&, qmcplusplus::TinyVector const&, int, bool) | Single | 0.19 | 0.11 | 0.07 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 65.57 | 77.87 | 423.79 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 2531 | exec | ParticleBConds3DSoa.h:237-255 | qmcplusplus::SoaDistanceTableAAOMPTarget::evaluate(qmcplusplus::ParticleSet&) | Innermost | 0.14 | 0.09 | 0.06 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 65.57 | 77.87 | 579.93 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 1097 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.16 | 0.09 | 0.06 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 130.59 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 1126 | exec | inner_product.hpp:211-212 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.10 | 0.09 | 0.06 | 1.00 | 1.00 | 3.50 | 1.17 | 1 | 14.29 | 28.57 | 0.00 | 4.67 | 4.67 | 4.67 | 1.33 | 4.00 |
| ○Loop 2819 | exec | ParticleBConds3DSoa.h:237-255 | qmcplusplus::SoaDistanceTableABOMPTarget::move(qmcplusplus::ParticleSet const&, qmcplusplus::TinyVector const&, int, bool) | Single | 0.14 | 0.08 | 0.05 | 1.00 | 1.00 | 1.00 | 1.21 - 1.21 | 1 | 65.57 | 77.87 | 582.06 | 17.00 | 17.00 | 17.00 | 17.00 | 14.00 - 14.08 |
| ○Loop 1118 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94,inner_product.hpp:155-155 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.10 | 0.07 | 0.05 | 1.00 | 1.41 | 1.45 | 1.00 | 1 | 100.00 | 60.87 | 98.10 | 4.00 | 4.00 | 2.83 | 2.76 | 4.00 |
| ○Loop 752 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Innermost | 0.13 | 0.07 | 0.04 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 100.12 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 1096 | exec | OperatorTags.h:63-63,OperatorTags.h:94-94 | miniqmcreference::DiracDeterminantRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.12 | 0.06 | 0.04 | 1.00 | 1.41 | 1.47 | 1.00 | 1 | 100.00 | 60.87 | 586.53 | 4.00 | 4.00 | 2.83 | 2.71 | 4.00 |
| ○Loop 304833 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.14 | 0.06 | 0.04 | NA | NA | NA | NA | NA | NA | NA | 0.29 | NA | NA | NA | NA | NA |
| ○Loop 202 | exec | BsplineFunctor.h:236-241 | miniqmcreference::OneBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.09 | 0.06 | 0.04 | 1.59 | 1.00 | 4.25 | 17.00 | 6561 | 0.00 | 20.19 | 14.13 | 8.50 | 5.33 | 8.50 | 2.00 | 0.50 |
| ○Loop 296 | exec | stl_vector.h:1147-1147,BsplineFunctor.h:246-260 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Innermost | 0.09 | 0.05 | 0.03 | 1.00 | 1.00 | 1.00 | 1.60 | 1 | 75.56 | 100.00 | 783.89 | 40.00 | 40.00 | 40.00 | 40.00 | 25.00 |
| ○Loop 545 | exec | TwoBodyJastrowRef.h:381-382 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.08 | 0.04 | 0.03 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 156.99 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 34841 | libarmpl_lp64.so | | dgemm_sve_big | InBetween | 0.08 | 0.04 | 0.03 | NA | NA | NA | NA | NA | NA | NA | 1159.96 | NA | NA | NA | NA | NA |
| ○Loop 234 | exec | stl_vector.h:1147-1147,BsplineFunctor.h:305-336 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Innermost | 0.07 | 0.04 | 0.02 | 1.00 | 1.00 | 1.01 | 1.35 | 1 | 86.70 | 98.18 | 564.79 | 69.00 | 69.00 | 69.00 | 68.03 | 51.00 |
| ○Loop 1100 | exec | OhmmsVector.h:178-178,OhmmsVector.h:248-249,SPOSet.h:83-86,inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.06 | 0.03 | 0.02 | 2.38 | 1.00 | 1.83 | 3.57 | 4 | 7.19 | 30.19 | 31.49 | 4.46 | 1.88 | 4.46 | 2.44 | 1.25 |
| ○Loop 293 | exec | refwrap.h:351-351,stl_vector.h:993-993,OhmmsVector.h:223-223,OhmmsVector.h:249-249,TwoBodyJastrowRef.h:107-108,TwoBodyJastrowRef.h:125-127,BsplineFunctor.h:231-231,BsplineFunctor.h:246-246,optional:991-991 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.07 | 0.03 | 0.02 | 5.00 | 1.59 | 1.58 | 5.83 | 1000012 | 0.00 | 26.65 | 6.78 | 5.83 | 1.17 | 3.67 | 3.69 | 1.00 |
| ○Loop 232 | exec | stl_algobase.h:238-238,TwoBodyJastrowRef.h:269-271,ParticleSet.h:313-313,ParticleSet.h:316-316,BsplineFunctor.h:276-276,BsplineFunctor.h:283-284,BsplineFunctor.h:291-298,BsplineFunctor.h:336-336 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Outermost | 0.06 | 0.03 | 0.02 | 1.10 | 1.33 | 2.05 | 75.00 | 1000012 | 23.89 | 41.04 | 29.27 | 37.50 | 34.00 | 28.13 | 18.31 | 0.50 |
| ○Loop 334699 | libarmpl_lp64.so | | void armpl::clag::lu_unblocked_direct_kernel(long, long, double*, long, int*, int&) | Innermost | 0.05 | 0.02 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 291.52 | NA | NA | NA | NA | NA |
| ○Loop 541 | exec | TwoBodyJastrowRef.h:397-398 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.05 | 0.02 | 0.02 | 1.00 | 1.00 | 1.00 | 2.00 | 1 | 100.00 | 100.00 | 234.87 | 2.00 | 2.00 | 2.00 | 2.00 | 1.00 |
| ○Loop 304754 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.05 | 0.02 | 0.02 | NA | NA | NA | NA | NA | NA | NA | 4.46 | NA | NA | NA | NA | NA |
| ○Loop 550 | exec | BsplineFunctor.h:291-298 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.05 | 0.02 | 0.01 | 1.92 | 1.00 | 5.13 | 13.67 | 6561 | 0.00 | 18.88 | 58.53 | 10.25 | 5.33 | 10.25 | 2.00 | 0.75 |
| ○Loop 199 | exec | OneBodyJastrowRef.h:134-135,OneBodyJastrowRef.h:140-141,OneBodyJastrowRef.h:151-151,OneBodyJastrowRef.h:154-155,stl_vector.h:993-993,stl_vector.h:1128-1128,stl_vector.h:1131-1131,OhmmsVector.h:223-223,OhmmsVector.h:229-229,OhmmsVector.h:249-249,BsplineFunctor.h:166-173,BsplineFunctor.h:179-182,BsplineFunctor.h:246-246 | miniqmcreference::OneBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.04 | 0.02 | 0.01 | 1.87 | 1.35 | 1.73 | 1.87 | 1000012 | 11.48 | 29.92 | 5.58 | 12.13 | 6.50 | 9.00 | 7.02 | 6.50 |
| ○Loop 231 | exec | stl_numeric.h:140-141 | miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.05 | 0.02 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 367.54 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 1117 | exec | inner_product.hpp:82-83 | miniqmcreference::DiracDeterminantRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.05 | 0.02 | 0.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 86.50 | 2.00 | 2.00 | 2.00 | 2.00 | 2.00 |
| ○Loop 294 | exec | TwoBodyJastrowRef.h:127-129,TwoBodyJastrowRef.h:132-132,ParticleSet.h:313-313,ParticleSet.h:316-316,BsplineFunctor.h:232-232,BsplineFunctor.h:236-241,BsplineFunctor.h:246-246,BsplineFunctor.h:259-260 | miniqmcreference::TwoBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.05 | 0.02 | 0.01 | 1.28 | 1.01 | 1.23 | 8.79 | 1000012 | 4.23 | 23.28 | 38.55 | 15.38 | 12.00 | 15.28 | 12.50 | 1.75 |
| ○Loop 33845 | libarmpl_lp64.so | | daxpby_sve_kernel | Single | 0.05 | 0.02 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 185.26 | NA | NA | NA | NA | NA |
| ○Loop 913 | exec | stl_algobase.h:951-952,MultiBsplineEvalHelper.hpp:53-53,MultiBsplineEvalHelper.hpp:60-63,einspline_spo_ref.hpp:175-176,stl_vector.h:1263-1263,MultiBsplineRef.hpp:42-44,MultiBsplineRef.hpp:47-49,MultiBsplineRef.hpp:57-58,MultiBsplineRef.hpp:66-68,MultiBsplineData.hpp:54-57 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | Outermost | 0.04 | 0.02 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 55.40 | NA | NA | NA | NA | NA |
| ○Loop 526 | exec | stl_vector.h:1147-1147,BsplineFunctor.h:305-336 | miniqmcreference::TwoBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet const&, int, double const*, double*, double*, double*, bool) | Innermost | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 418.15 | NA | NA | NA | NA | NA |
| ○Loop 3034 | exec | OMPTargetMath.hpp:38-38,SoaDistanceTableABOMPTarget.h:215-215,SoaDistanceTableABOMPTarget.h:218-218,SoaDistanceTableABOMPTarget.h:222-222,SoaDistanceTableABOMPTarget.h:228-228,ParticleBConds3DSoa.h:284-286,ParticleBConds3DSoa.h:294-296 | qmcplusplus::SoaDistanceTableABOMPTarget::evaluate(qmcplusplus::ParticleSet&) | InBetween | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 126.81 | NA | NA | NA | NA | NA |
| ○Loop 139 | exec | OperatorTags.h:94-94,WaveFunction.cpp:185-188,refwrap.h:351-351,stl_vector.h:993-993,stl_vector.h:1128-1128,NewTimer.h:242-242,NewTimer.h:249-249 | qmcplusplus::WaveFunction::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.52 | NA | NA | NA | NA | NA |
| ○Loop 915 | exec | MultiBsplineRef.hpp:64-66 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | InBetween | 0.05 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 196.53 | NA | NA | NA | NA | NA |
| ○Loop 366398 | libarmpl_lp64.so | | void armpl::clag::(anonymous namespace)::trsm_kernel(double const*, long, long, double*, long, long, long, long) | Single | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 69.42 | NA | NA | NA | NA | NA |
| ○Loop 304782 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.04 | 0.01 | 0.01 | NA | NA | NA | NA | NA | NA | NA | 0.59 | NA | NA | NA | NA | NA |
| ○Loop 542 | exec | TwoBodyJastrowRef.h:388-391 | miniqmcreference::TwoBodyJastrowRef >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 207.17 | NA | NA | NA | NA | NA |
| ○Loop 912 | exec | stl_algobase.h:238-238,stl_algobase.h:413-413,stl_algobase.h:451-452,einspline_spo_ref.hpp:183-187,stl_algo.h:683-683,stl_algo.h:709-709,stl_vector.h:1263-1263 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | Single | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 26 | exec | TinyVector.h:62-62,refwrap.h:351-351,stl_vector.h:1131-1131,miniqmc.cpp:429-458 | main._omp_fn.1 | Innermost | 0.04 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.72 | NA | NA | NA | NA | NA |
| ○Loop 1158 | exec | stl_algobase.h:939-940 | miniqmcreference::DiracDeterminantRef >::resize(int, int) | Innermost | 0.03 | 0.01 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 303661 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 68.85 | NA | NA | NA | NA | NA |
| ○Loop 524 | exec | stl_algobase.h:238-238,TwoBodyJastrowRef.h:269-271,ParticleSet.h:313-313,ParticleSet.h:316-316,BsplineFunctor.h:276-276,BsplineFunctor.h:283-284,BsplineFunctor.h:291-298,BsplineFunctor.h:336-336 | miniqmcreference::TwoBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet const&, int, double const*, double*, double*, double*, bool) | Outermost | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 19.85 | NA | NA | NA | NA | NA |
| ○Loop 201 | exec | OneBodyJastrowRef.h:143-146,shared_ptr_base.h:1667-1667,OhmmsVector.h:223-223,ParticleSet.h:313-313,ParticleSet.h:316-316,BsplineFunctor.h:231-232,BsplineFunctor.h:236-241,BsplineFunctor.h:246-246,BsplineFunctor.h:259-260 | miniqmcreference::OneBodyJastrowRef >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | InBetween | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 5.47 | NA | NA | NA | NA | NA |
| ○Loop 945 | exec | stl_algobase.h:951-952,MultiBsplineEvalHelper.hpp:53-56,MultiBsplineEvalHelper.hpp:60-63,MultiBsplineEvalHelper.hpp:66-66,einspline_spo_ref.hpp:206-207,einspline_spo_ref.hpp:230-230,stl_vector.h:1263-1263,VectorSoAContainer.h:265-265,MultiBsplineRef.hpp:187-192,MultiBsplineRef.hpp:199-200,MultiBsplineRef.hpp:205-206,MultiBsplineRef.hpp:209-213,MultiBsplineRef.hpp:229-230,MultiBsplineRef.hpp:269-286,MultiBsplineData.hpp:68-79 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Outermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 10.76 | NA | NA | NA | NA | NA |
| ○Loop 1968 | exec | NewTimer.h:119-119,stl_tree.h:786-786,stl_tree.h:1947-1949 | qmcplusplus::TimerType::stop() | Single | 0.22 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.35 | NA | NA | NA | NA | NA |
| ○Loop 145 | exec | WaveFunction.cpp:269-269,WaveFunction.cpp:272-274,refwrap.h:351-351,stl_vector.h:992-993,stl_vector.h:1131-1131,NewTimer.h:242-242,NewTimer.h:249-249 | qmcplusplus::WaveFunction::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector >&) | Outermost | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 1377 | exec | ParticleSet.cpp:242-243,stl_vector.h:993-993,unique_ptr.h:193-193 | qmcplusplus::ParticleSet::update(bool) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 916 | exec | MultiBsplineRef.hpp:63-66 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&) | InBetween | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 104.35 | NA | NA | NA | NA | NA |
| ○Loop 1964 | exec | NewTimer.h:119-119,stl_tree.h:786-786,stl_tree.h:1947-1949 | qmcplusplus::TimerType::stop() | Single | 0.13 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304018 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<11ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 98.92 | NA | NA | NA | NA | NA |
| ○Loop 304547 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 304756 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<15ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.04 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 32.49 | NA | NA | NA | NA | NA |
| ○Loop 140 | exec | WaveFunction.cpp:198-201,refwrap.h:351-351,stl_vector.h:993-993,NewTimer.h:242-242,NewTimer.h:249-249 | qmcplusplus::WaveFunction::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&) | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 192 | exec | OneBodyJastrowRef.h:214-218,stl_vector.h:1131-1131,shared_ptr_base.h:1667-1667,OhmmsVector.h:223-223,ParticleSet.h:316-316 | miniqmcreference::OneBodyJastrowRef >::computeU3(qmcplusplus::ParticleSet&, int, double const*) [clone .isra.0] | Single | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 1.86 | NA | NA | NA | NA | NA |
| ○Loop 304492 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 34827 | libarmpl_lp64.so | | dgemm_reference_ | Innermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 2.26 | NA | NA | NA | NA | NA |
| ○Loop 303983 | libarmpl_lp64.so | | void armpl::clag::gemv_a_cntg_first(long, long, double, double const*, long, long, double const*, long, double, double*, long) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 37.74 | NA | NA | NA | NA | NA |
| ○Loop 942 | exec | stl_algobase.h:238-238,TinyVector.h:145-145,einspline_spo_ref.hpp:219-219,einspline_spo_ref.hpp:223-227,stl_vector.h:1131-1131,OhmmsVector.h:223-223,VectorSoAContainer.h:231-231,VectorSoAContainer.h:271-271 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | Outermost | 0.03 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 57 | exec | NonLocalPP.hpp:122-122,NonLocalPP.hpp:126-126,stl_vector.h:1150-1150,ParticleSet.h:277-277 | main._omp_fn.1 | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 334701 | libarmpl_lp64.so | | void armpl::clag::lu_unblocked_direct_kernel(long, long, double*, long, int*, int&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 94.97 | NA | NA | NA | NA | NA |
| ○Loop 2815 | exec | stl_algobase.h:413-413,stl_algobase.h:451-453,SoaDistanceTableABOMPTarget.h:375-376,stl_algo.h:683-683,stl_algo.h:709-709,stl_vector.h:1131-1131,VectorSoAContainer.h:271-271 | qmcplusplus::SoaDistanceTableABOMPTarget::update(int) | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 285 | exec | OneBodyJastrowRef.h:192-193 | miniqmcreference::OneBodyJastrowRef >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&) | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 85.92 | NA | NA | NA | NA | NA |
| ○Loop 304493 | libarmpl_lp64.so | | auto armpl::clag::execute_strategy<16ul, std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync >, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> >(std::tuple, armpl::clag::matmul::large_no_sync, armpl::clag::matmul::rank_k_update_large, armpl::clag::matmul::rank_k_update_basic, armpl::clag::matmul::rank_one_update, armpl::clag::matmul::gemm_reference, armpl::clag::matmul::symm_hemm_l_reference, armpl::clag::matmul::symm_hemm_r_reference, armpl::clag::matmul::syrk_herk_reference, armpl::clag::matmul::backstop, armpl::clag::matmul::large_no_sync > const&, armpl::clag::spec::problem_context >, armpl::clag::general_matrix >, armpl::clag::general_matrix >, double>, armpl::clag::spec::sve_architecture_spec> const&) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 35071 | libarmpl_lp64.so | | ddot_kernel | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 39.57 | NA | NA | NA | NA | NA |
| ○Loop 948 | exec | MultiBsplineRef.hpp:226-226,MultiBsplineRef.hpp:234-236 | miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&) | InBetween | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 35.61 | NA | NA | NA | NA | NA |
| ○Loop 37170 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 37168 | libarmpl_lp64.so | | n_interleave_kernel_d8 | Innermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 746 | exec | stl_vector.h:1147-1147,BsplineFunctor.h:305-336 | qmcplusplus::BsplineFunctor::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 303659 | libarmpl_lp64.so | | void armpl::clag::gemv_a_strd_first_impl, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double>(long, long, double, double const*, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double const*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, double*, armpl::clag::(anonymous namespace)::step_val_fixed<1l>) [clone .isra.0] | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 7.91 | NA | NA | NA | NA | NA |
| ○Loop 1105 | exec | OperatorTags.h:61-63,OperatorTags.h:94-94,TinyVector.h:62-62,OhmmsMatrix.h:217-217,TinyVectorOps.h:129-131,OhmmsVector.h:248-248,inner_product.hpp:82-83,inner_product.hpp:155-156,DiracDeterminantRef.cpp:173-173,DiracDeterminantRef.cpp:178-178 | miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool) | Outermost | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |
| ○Loop 142 | exec | WaveFunction.cpp:225-228,refwrap.h:351-351,stl_vector.h:993-993,stl_vector.h:1128-1128,NewTimer.h:242-242,NewTimer.h:249-249 | qmcplusplus::WaveFunction::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.02 | 0.00 | 0.00 | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |