options

Functions and Loops

188 loops and 165 functions have been discarded from the report because their ratio ((Max Inclusive Time Over Threads * 100) / Max Thread Active Time) is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.79% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis.
Inclusive metrics are only related to the given object code and do not include other external objects / libraries.

Columns Filter

Max Thread Time / Walltime armclang_1 (%) Coverage armclang_1 (%) Coverage Excluding Loops armclang_1 (%) Max Inclusive Time Over Threads armclang_1 (s) Max Exclusive Time Over Threads armclang_1 (s) Inclusive Time w.r.t. Wall Time armclang_1 (s) Exclusive Time w.r.t. Wall Time armclang_1 (s) Nb Threads armclang_1 Deviation (coverage) armclang_1 Deviation (walltime) armclang_1 Categories armclang_1 GFLOPS armclang_1 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories GFLOPS Compilation Options
NameModuleMax Thread Time / Walltime armclang_1 (%)Coverage armclang_1 (%)Coverage Excluding Loops armclang_1 (%)Max Inclusive Time Over Threads armclang_1 (s)Max Exclusive Time Over Threads armclang_1 (s)Inclusive Time w.r.t. Wall Time armclang_1 (s)Exclusive Time w.r.t. Wall Time armclang_1 (s)Nb Threads armclang_1Deviation (coverage) armclang_1Deviation (walltime) armclang_1Categories armclang_1GFLOPS armclang_1Compilation Options
dgemm_sve_biglibarmpl_lp64.so31.4932.280.0049.550.4648.980.01640.270.37Math (%): 100.00874.22
void miniqmcreference::MultiBsplineEvalRef::evaluate_v<double>(qmcplusplus::bspline_traits<double, 3u>::SplineType const*, double, double, double, double*, unsigned long)+exec23.2723.620.0136.611.2135.850.02640.300.46Exe (%): 100.0087.17Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 813 - MultiBsplineRef.hpp:63-71 - exec+0.0323.610.0137.830.0535.830.02370.010.0144.93
Loop 810 - MultiBsplineRef.hpp:68-71 - exec5.975.855.859.409.408.888.88640.140.2087.88
Loop 809 - MultiBsplineRef.hpp:68-71 - exec5.995.935.939.439.439.009.00640.140.2286.69
Loop 812 - MultiBsplineRef.hpp:68-71 - exec6.045.975.979.509.509.069.06640.140.2286.15
Loop 811 - MultiBsplineRef.hpp:68-71 - exec6.015.855.859.469.468.888.88640.140.2188.11
qmcplusplus::SoaDistanceTableAAOMPTarget<double, 3u, 40>::update(int)+exec8.198.240.0012.890.2512.510.00640.170.26Exe (%): 100.000.72Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 1794 - VectorSoAContainer.h:244-263 - exec [...]8.198.248.2412.8912.8912.5012.50640.170.260.72
void miniqmcreference::MultiBsplineEvalRef::evaluate_vgh<double>(qmcplusplus::bspline_traits<double, 3u>::SplineType const*, double, double, double, double*, double*, double*, unsigned long)+exec7.937.870.0112.481.2711.940.02640.190.30Exe (%): 100.00394.09Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 819 - MultiBsplineRef.hpp:226-262 - exec [...]+0.027.710.0012.260.0411.690.0080.000.01185.10
Loop 820 - MultiBsplineRef.hpp:242-262 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 817 - MultiBsplineRef.hpp:226-226 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 818 - MultiBsplineRef.hpp:242-262 - exec7.777.707.7012.2212.2211.6911.69640.190.30396.69
Loop 815 - MultiBsplineRef.hpp:276-286 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 816 - MultiBsplineRef.hpp:276-286 - exec0.200.150.150.310.310.230.23640.030.04280.42
n_interleave_kernel_d8libarmpl_lp64.so6.706.620.0210.551.8210.040.03640.170.27Math (%): 100.000.00
qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::evaluate(qmcplusplus::ParticleSet&)+exec4.284.150.016.730.906.300.01640.120.19Exe (%): 100.00380.51Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 2057 - SoaDistanceTableABOMPTarget.h:194-196 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 2056 - SoaDistanceTableABOMPTarget.h:194-196 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 2052 - SoaDistanceTableABOMPTarget.h:214-228 - exec [...]+0.034.140.016.790.056.290.02480.010.0167.24
Loop 2053 - ParticleBConds3DSoa.h:280-298 - exec [...]+0.044.130.026.740.066.270.03540.010.01185.51
Loop 2055 - ParticleBConds3DSoa.h:280-298 - exec4.244.114.116.686.686.246.24640.120.19383.05
Loop 2054 - SoaDistanceTableABOMPTarget.h:228-228 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
void armpl::clag::gemv_a_cntg_first<false, armpl::clag::spec::sve_architecture_spec, double, double, double, double>(long, long, double, double const*, long, long, double const*, long, double, double*, long)libarmpl_lp64.so2.152.060.013.380.633.130.01640.080.13Math (%): 100.0091.90
down_read_trylockkernel2.241.721.723.53165.902.612.61640.310.47System (%): 99.98
Math (%): 0.02
0.00
void qmcplusplus::DTD_BConds<double, 3u, 40>::computeDistances<qmcplusplus::TinyVector<double, 3u>, qmcplusplus::VectorSoAContainer<double, 3u, qmcplusplus::Mallocator<double, 32ul> >, qmcplusplus::VectorSoAContainer<double, 3...+exec1.581.470.002.480.392.240.01640.080.12Exe (%): 100.00550.69Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 1262 - ParticleBConds3DSoa.h:235-255 - exec1.571.471.472.482.482.232.23640.080.12552.13
Loop 1263 - ParticleBConds3DSoa.h:234-255 - exec0.000.000.000.000.000.000.0000.000.000.00
miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&)+exec1.551.460.012.431.312.210.02640.070.10Exe (%): 100.0049.76Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 323 - BsplineFunctor.h:233-260 - exec [...]+0.041.440.022.540.062.190.03540.010.0175.25
Loop 324 - BsplineFunctor.h:233-260 - exec [...]+1.431.421.372.482.252.162.07640.070.1031.77
Loop 325 - BsplineFunctor.h:233-260 - exec [...]+0.030.060.010.230.050.090.02420.010.0129.89
Loop 326 - BsplineFunctor.h:246-260 - exec [...]0.060.040.040.100.100.060.06640.010.02690.67
Loop 322 - BsplineFunctor.h:233-236 - exec [...]+0.030.010.010.070.040.010.01260.000.0111.74
Loop 321 - TwoBodyJastrowRef.h:107-127 - exec [...]0.020.000.000.040.040.000.00100.000.0112.49
Loop 327 - BsplineFunctor.h:246-260 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
auto armpl::clag::execute_strategy<15ul, std::tuple<armpl::clag::matmul::set_or_scale, armpl::clag::matmul::compressed_general_matrix_vector, armpl::clag::matmul::symmetric_matrix_vector, armpl::clag::matmul::compressed_symmetric_matrix_vector, armpl...libarmpl_lp64.so1.541.430.012.421.222.170.02640.070.10Math (%): 100.000.73
void armpl::clag::gemv_a_strd_first_impl<false, armpl::clag::spec::sve_architecture_spec, double, long, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::clag::(anonymous namespace)::step_val_fixed<1l>, double, armpl::cl...libarmpl_lp64.so1.331.250.012.101.341.900.02640.070.11Math (%): 100.00117.29
miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::acceptMove(qmcplusplus::ParticleSet&, int)+exec0.890.850.031.402.741.290.04640.050.07Exe (%): 100.00112.44Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 316 - TwoBodyJastrowRef.h:324-331 - exec0.240.180.180.390.390.270.27640.030.04199.69
Loop 311 - TwoBodyJastrowRef.h:342-347 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 315 - TwoBodyJastrowRef.h:342-347 - exec0.250.220.220.400.400.330.33640.020.0491.47
Loop 313 - TwoBodyJastrowRef.h:342-347 - exec0.270.210.210.420.420.320.32640.020.0494.87
Loop 317 - TwoBodyJastrowRef.h:269-274 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 309 - TwoBodyJastrowRef.h:324-331 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 318 - TwoBodyJastrowRef.h:269-274 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 314 - TwoBodyJastrowRef.h:342-347 - exec0.250.220.220.400.400.330.33640.020.0392.19
Loop 312 - TwoBodyJastrowRef.h:342-347 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 310 - TwoBodyJastrowRef.h:342-347 - exec0.000.000.000.000.000.000.0000.000.000.00
up_readkernel0.770.690.691.2266.151.041.04640.070.10System (%): 99.99
Math (%): 0.01
0.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so1.300.630.632.0460.520.950.95640.210.31OMP (%): 100.000.00
__memcpylibastring.so0.640.570.571.0054.710.860.86640.040.06String (%): 99.01
Math (%): 0.99
0.04
miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<...+exec0.610.570.000.950.200.860.00640.040.05Exe (%): 100.0056.44Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 920 - OhmmsVector.h:223-223 - exec [...]+0.000.560.001.050.000.850.0000.000.000.00
Loop 922 - inner_product.hpp:82-155 - exec [...]+0.000.200.000.390.000.300.0000.000.000.00
Loop 919 - inner_product.hpp:82-83 - exec0.240.200.200.390.390.300.30640.020.0340.01
Loop 921 - inner_product.hpp:155-155 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 923 - inner_product.hpp:82-83 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 918 - inner_product.hpp:155-155 - exec [...]0.420.360.360.670.670.550.55640.030.0465.35
rmqueue_pcplistkernel0.780.470.471.2445.050.710.71610.250.38System (%): 100.000.00
qmcplusplus::BsplineFunctor<double>::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const+exec0.510.430.030.802.710.650.04640.040.06Exe (%): 100.0094.85Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 272 - BsplineFunctor.h:291-298 - exec0.440.360.360.690.690.550.55640.040.0544.71
Loop 271 - BsplineFunctor.h:303-336 - exec [...]0.080.040.040.120.120.060.06640.010.02607.75
miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&)+exec0.440.370.030.702.640.570.04640.040.06Exe (%): 100.00149.88Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 300 - TwoBodyJastrowRef.h:155-156 - exec0.160.120.120.250.250.180.18640.020.03136.36
Loop 305 - TwoBodyJastrowRef.h:155-156 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 301 - TwoBodyJastrowRef.h:155-156 - exec0.160.110.110.250.250.170.17640.020.03143.35
Loop 308 - TwoBodyJastrowRef.h:269-274 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 302 - TwoBodyJastrowRef.h:155-156 - exec0.160.100.100.260.260.150.15640.020.03156.47
Loop 303 - TwoBodyJastrowRef.h:155-156 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 306 - stl_numeric.h:140-141 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 307 - stl_numeric.h:140-141 - exec0.030.020.020.050.050.020.02500.010.01374.94
Loop 304 - TwoBodyJastrowRef.h:155-156 - exec0.000.000.000.000.000.000.0000.000.000.00
miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evalGrad(qmcplusplus::ParticleSet&, int)+exec0.460.350.000.730.300.540.00640.040.06Exe (%): 100.0067.76Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 910 - inner_product.hpp:155-155 - exec [...]0.460.350.350.730.730.530.53640.040.0668.11
Loop 911 - inner_product.hpp:155-155 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, std::vector<double, st...+exec0.400.340.000.630.250.520.00640.030.05Exe (%): 100.0093.46Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 804 - stl_vector.h:1131-1131 - exec [...]+0.000.340.000.650.000.520.0000.000.000.00
Loop 802 - inner_product.hpp:82-83 - exec0.370.320.320.580.580.480.48640.030.0597.96
Loop 803 - inner_product.hpp:82-83 - exec [...]+0.040.020.020.070.070.040.04600.010.0141.99
Loop 805 - inner_product.hpp:82-83 - exec0.000.000.000.000.000.000.0000.000.000.00
miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<qmcplusplus::TinyVector<double, 3u>, std::allocator<...+exec0.370.310.000.570.160.480.00640.040.06Exe (%): 100.000.00Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 800 - einspline_spo_ref.hpp:219-227 - exec [...]+0.000.310.000.570.000.470.0000.000.000.00
Loop 801 - einspline_spo_ref.hpp:223-227 - exec [...]0.370.310.310.570.570.470.47640.040.060.00
__memsetlibastring.so0.350.280.280.5527.340.430.43640.030.05String (%): 99.95
Math (%): 0.05
0.26
dswap_libarmpl_lp64.so0.220.160.000.340.180.250.00640.020.03Math (%): 100.000.00
auto armpl::clag::execute_strategy<16ul, std::tuple<armpl::clag::matmul::set_or_scale, armpl::clag::matmul::compressed_general_matrix_vector, armpl::clag::matmul::symmetric_matrix_vector, armpl::clag::matmul::compressed_symmetric_matrix_vector, armpl...libarmpl_lp64.so0.200.140.020.311.520.220.02640.020.03Math (%): 100.000.57
miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&)+exec0.160.120.010.250.540.190.01640.020.03Exe (%): 100.00260.15Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 912 - inner_product.hpp:82-83 - exec0.100.060.060.150.150.090.09640.020.02128.67
Loop 913 - inner_product.hpp:155-155 - exec [...]0.080.060.060.130.130.080.08640.010.02426.30
Loop 914 - inner_product.hpp:155-155 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 915 - inner_product.hpp:82-83 - exec0.000.000.000.000.000.000.0000.000.000.00
arch_local_irq_enablekernel0.190.100.100.309.990.160.16640.030.05OMP (%): 92.44
System (%): 7.46
Math (%): 0.10
0.00
qmcplusplus::NonLocalPP<double>::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&)+exec0.120.090.000.190.460.140.01640.020.02Exe (%): 100.007.67Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 61 - OhmmsVector.h:144-210 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 63 - NonLocalPP.hpp:122-135 - exec [...]+0.120.090.080.220.190.130.13640.020.028.14
Loop 62 - ParticleSet.h:277-277 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 64 - NonLocalPP.hpp:131-132 - exec [...]0.020.000.000.030.030.000.00140.000.001.01
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.140.080.080.227.290.110.11640.030.04OMP (%): 100.000.00
miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector&l...+exec0.110.070.000.170.280.110.00640.010.02Exe (%): 100.0087.62Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 906 - inner_product.hpp:82-155 - exec [...]+0.000.070.000.200.000.110.0000.000.000.00
Loop 904 - inner_product.hpp:155-155 - exec [...]0.090.060.060.140.140.090.09640.010.0284.49
Loop 908 - inner_product.hpp:82-155 - exec [...]+0.000.010.000.060.000.020.0000.000.000.00
Loop 905 - inner_product.hpp:82-83 - exec0.040.010.010.060.060.020.02460.010.0184.62
Loop 907 - inner_product.hpp:155-155 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 909 - inner_product.hpp:82-83 - exec0.000.000.000.000.000.000.0000.000.000.00
el0_svc_common.constprop.0kernel0.140.060.060.226.250.100.10640.030.04OMP (%): 99.92
System (%): 0.08
0.00
miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&)+exec0.110.060.030.183.170.100.05640.020.03Exe (%): 100.00249.36Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 333 - TwoBodyJastrowRef.h:268-398 - exec [...]+0.000.030.000.300.000.050.0000.000.000.00
Loop 335 - TwoBodyJastrowRef.h:268-398 - exec [...]+0.000.030.000.300.000.050.0000.000.000.00
Loop 346 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 348 - TwoBodyJastrowRef.h:397-398 - exec0.020.000.000.020.020.000.0040.000.00109.26
Loop 347 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 350 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 334 - TwoBodyJastrowRef.h:388-391 - exec0.020.010.010.030.030.010.01240.000.01234.45
Loop 354 - TwoBodyJastrowRef.h:397-398 - exec0.010.000.000.020.020.000.0080.000.00115.48
Loop 349 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 351 - TwoBodyJastrowRef.h:397-398 - exec0.020.000.000.030.030.000.0080.000.0083.43
Loop 332 - TwoBodyJastrowRef.h:268-388 - exec [...]+0.000.020.000.190.000.030.0000.000.000.00
Loop 330 - TwoBodyJastrowRef.h:361-364 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 338 - TwoBodyJastrowRef.h:381-382 - exec0.030.010.010.060.060.010.01260.010.0196.72
Loop 343 - TwoBodyJastrowRef.h:375-376 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 339 - TwoBodyJastrowRef.h:381-382 - exec0.040.010.010.070.070.010.01240.010.0187.34
Loop 340 - TwoBodyJastrowRef.h:381-382 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 342 - TwoBodyJastrowRef.h:381-382 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 344 - stl_numeric.h:140-141 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 345 - TwoBodyJastrowRef.h:269-274 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 331 - stl_numeric.h:140-141 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 341 - TwoBodyJastrowRef.h:381-382 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 336 - TwoBodyJastrowRef.h:375-376 - exec0.010.000.000.020.020.000.0020.000.00199.79
Loop 337 - TwoBodyJastrowRef.h:381-382 - exec0.030.010.010.040.040.010.01320.000.0188.02
Loop 352 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 353 - TwoBodyJastrowRef.h:397-398 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 329 - TwoBodyJastrowRef.h:388-391 - exec0.000.000.000.000.000.000.0000.000.000.00
qmcplusplus::DiracMatrix<double, double>::invert_transpose(qmcplusplus::Matrix<double, std::allocator<double> > const&, qmcplusplus::Matrix<double, std::allocator<double> >&, double&, double&)+exec0.070.060.000.110.010.090.00640.010.01Exe (%): 100.000.00Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 890 - DiracMatrix.h:112-113 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 891 - inner_product.hpp:210-212 - exec+0.000.060.000.110.000.090.0000.000.000.00
Loop 894 - inner_product.hpp:211-212 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 893 - inner_product.hpp:211-212 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 892 - inner_product.hpp:211-212 - exec0.070.060.060.110.110.090.09640.010.010.00
Loop 889 - DiracMatrix.h:31-35 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&)+exec0.090.060.020.132.020.090.03640.010.02Exe (%): 100.00120.67Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 228 - OneBodyJastrowRef.h:192-193 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 226 - OneBodyJastrowRef.h:192-193 - exec0.030.010.010.040.040.010.01330.000.0170.00
Loop 221 - stl_numeric.h:140-141 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 223 - OneBodyJastrowRef.h:186-187 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 225 - OneBodyJastrowRef.h:192-193 - exec0.030.010.010.060.060.010.01350.010.0177.74
Loop 224 - OneBodyJastrowRef.h:192-193 - exec0.040.020.020.060.060.030.03560.010.0174.28
Loop 229 - OneBodyJastrowRef.h:192-193 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 222 - stl_numeric.h:140-141 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 230 - OneBodyJastrowRef.h:186-187 - exec0.000.000.000.000.000.000.0000.000.000.00
Loop 227 - OneBodyJastrowRef.h:192-193 - exec0.000.000.000.000.000.000.0000.000.000.00
__pi_clear_pagekernel0.090.060.060.145.430.090.09640.020.03System (%): 99.91
Math (%): 0.09
0.00
qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const+exec0.060.040.010.100.650.070.01640.010.01Exe (%): 100.007.13Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 276 - BsplineFunctor.h:246-260 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 275 - BsplineFunctor.h:246-260 - exec [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 277 - BsplineFunctor.h:236-241 - exec0.050.040.040.080.080.050.05640.010.016.66
el0_dakernel0.060.040.040.093.830.060.06640.010.01System (%): 99.74
Math (%): 0.26
0.96
qmcplusplus::TimerType<std::chrono::_V2::system_clock>::start()+exec0.060.030.030.093.210.050.05640.010.01Exe (%): 100.004.26Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 1474 - NewTimer.cpp:53-54 - exec0.000.000.000.000.000.000.0000.000.000.00
miniqmcreference::einspline_spo_ref<double>::evaluate_v(qmcplusplus::ParticleSet const&, int)+exec0.060.030.030.092.590.050.04630.010.02Exe (%): 100.006.62Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 808 - einspline_spo_ref.hpp:175-176 - exec [...]0.030.000.000.040.040.000.00120.000.013.97
folio_batch_move_lrukernel0.050.030.030.092.430.040.04590.010.01System (%): 99.60
Math (%): 0.40
0.00
__GI___sched_yieldlibc.so.60.050.020.020.082.240.040.04590.010.01OMP (%): 100.000.00
void armpl::clag::lu_unblocked_direct_kernel<armpl::clag::spec::sve_architecture_spec, double, int>(long, long, double*, long, int*, int&)libarmpl_lp64.so0.050.020.010.080.520.030.01580.010.01Math (%): 100.00314.07
rwsem_down_read_slowpathkernel0.040.020.020.061.980.030.03570.010.01System (%): 100.000.00
qmcplusplus::TimerType<std::chrono::_V2::system_clock>::stop()+exec0.230.020.020.351.490.030.02510.030.05Exe (%): 100.001.32Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 1475 - NewTimer.h:119-119 - exec [...]+0.000.000.000.080.000.000.0000.000.000.00
Loop 1476 - stl_tree.h:1947-1947 - exec [...]0.050.000.000.080.080.000.0010.000.000.00
Loop 1477 - NewTimer.h:119-119 - exec [...]+0.000.000.000.170.000.000.0000.000.000.00
Loop 1478 - stl_tree.h:1947-1947 - exec [...]0.110.000.000.170.170.000.0010.000.000.00
Loop 1479 - NewTimer.cpp:99-100 - exec0.000.000.000.000.000.000.0000.000.000.00
do_raw_spin_lockkernel0.050.020.020.081.650.030.03510.010.01System (%): 99.15
Math (%): 0.85
0.00
daxpby_sve_kernellibarmpl_lp64.so0.040.020.010.060.560.020.01530.010.01Math (%): 100.00227.18
release_pageskernel0.040.020.020.071.490.020.02520.010.01System (%): 100.000.00
auto armpl::clag::execute_strategy<10ul, std::tuple<armpl::clag::matmul::set_or_scale, armpl::clag::matmul::compressed_general_matrix_vector, armpl::clag::matmul::symmetric_matrix_vector, armpl::clag::matmul::compressed_symmetric_matrix_vector, armpl...libarmpl_lp64.so0.030.010.010.051.060.020.02410.010.01Math (%): 100.008.93
miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&)+exec0.030.010.000.050.050.020.00400.010.01Exe (%): 100.002.48Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 245 - OneBodyJastrowRef.h:134-135 - exec [...]0.030.010.010.050.050.020.02380.010.012.13
__kmp_get_global_thread_id_reglibomp.so0.030.010.010.041.070.020.02400.000.01OMP (%): 100.003.88
qmcplusplus::WaveFunction::evalGrad(qmcplusplus::ParticleSet&, int)+exec0.030.010.000.040.250.020.00430.000.01Exe (%): 100.001.27Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm ...
Loop 114 - WaveFunction.cpp:185-188 - exec [...]0.020.010.010.040.040.010.01350.000.011.16
×