options

Loops Index

Colums Filter

Level Coverage run_0 (%) Max Time Over Threads run_0 (s) Time w.r.t. Wall Time run_0 (s) Nb Threads run_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Speedup If Data in L1 run_0
Loop idSource LocationSource FunctionLevelCoverage run_0 (%)Max Time Over Threads run_0 (s)Time w.r.t. Wall Time run_0 (s)Nb Threads run_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_0Stride 0Stride 1Stride nStride UnknownStride IndirectSpeedup If Data in L1 run_0
672exec - MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&)Innermost33.090.220.2211001001111050007.45
679exec - MultiBsplineRef.hpp:249-270 [...]miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<qmcplusplus::TinyVector<double, 3u>, std::allocator<...Innermost23.530.160.1611001001.11111101005.1
971exec - ParticleBConds.h:185-217void qmcplusplus::DTD_BConds<double, 3u, 39>::computeDistances<qmcplusplus::TinyVector<double, 3u>, qmcplusplus::VectorSoAContainer<double, 3u, 64ul, qmcplusplus::Mallocator<double, 64ul> >, qmcplusplus::VectorSoAContainer<dou...Single11.030.070.07190.9189.221.0311.011900011.1
654exec - BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator<double, 64ul, qmcplusplus::Mallocator<double, 64ul> >::setCoefficientsForOrbitals(int, int, Array<double, 3u>&, multi_UBspline_3d_d*) [clone .extracted]Innermost2.940.020.02110010011110200011337.47
230exec - BsplineFunctor.h:236-241qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) constSingle2.210.010.01192.6871.671.1511.531010.50.50NA
228exec - BsplineFunctor.h:246-260qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) constSingle1.470.010.01110089.391.0311101002NA
780exec - inner_product.hpp:81-82 [...]miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<...Innermost1.470.010.0112515.63126.86102100NA
1264exec - __intel_avx_rep_memsetSingle0.7400110050112001000NA
1263exec - __intel_avx_rep_memcpySingle0.7400110050112002000NA
778exec - inner_product.hpp:81-82miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::ratio(qmcplusplus::ParticleSet&, int)Single0.74001100100111002000NA
991exec - ParticleIOUtility.h:70-91 [...]void qmcplusplus::expandSuperCell<qmcplusplus::ParticleSet>(qmcplusplus::ParticleSet&, qmcplusplus::Tensor<int, 3u> const&)Innermost0.7400142.6217.321.581.697.720NANANANANANA
241exec - TwoBodyJastrowRef.h:153-154miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&)Innermost0.74001100100111002000NA
×