options

Expert Summary

Colums Filter

Source Function Level Max Time Over Threads (s) Time w.r.t. Wall Time (s) Coverage (% app. time) Speedup if no scalar integer Speedup if FP arith vectorized Speedup if fully vectorized Speedup if FP only Number of paths Vectorization Ratio (%) Vector Length Use (%) ORIG / DL1 DL1/CQA(DL1) ORIG (cycles per iteration) STA (ORIG) DL1 (cycles per iteration) STA (DL1) CQA cycles CQA cycles if no scalar integer CQA cycles if FP arith vectorized CQA cycles if fully vectorized CQA cycles if FP only Instance Count min (Iteration count) avg (Iteration count) max (Iteration count) min (Cycles per Iteration) avg (Cycles per Iteration) max (Cycles per Iteration) Nb FP_ADD / CPI Nb FP_MUL / CPI CAP(FP) BW(FP) SAT(FP) CAP(L1R) BW(L1R) SAT(L1R) CAP(L1W) BW(L1W) SAT(L1W) CAP(L2) BW(L2) SAT(L2) CAP(L3) BW(L3) SAT(L3) CAP(RAM_R) CAP(RAM_W)
IDModuleSource LocationSource FunctionLevelMax Time Over Threads (s)Time w.r.t. Wall Time (s)Coverage (% app. time)Speedup if no scalar integerSpeedup if FP arith vectorizedSpeedup if fully vectorizedSpeedup if FP onlyNumber of pathsVectorization Ratio (%)Vector Length Use (%)ORIG / DL1DL1/CQA(DL1)ORIG (cycles per iteration)STA (ORIG)DL1 (cycles per iteration)STA (DL1)CQA cyclesCQA cycles if no scalar integerCQA cycles if FP arith vectorizedCQA cycles if fully vectorizedCQA cycles if FP onlyInstance Countmin (Iteration count)avg (Iteration count)max (Iteration count)min (Cycles per Iteration)avg (Cycles per Iteration)max (Cycles per Iteration)Nb FP_ADD / CPINb FP_MUL / CPICAP(FP)BW(FP)SAT(FP)CAP(L1R)BW(L1R)SAT(L1R)CAP(L1W)BW(L1W)SAT(L1W)CAP(L2)BW(L2)SAT(L2)CAP(L3)BW(L3)SAT(L3)CAP(RAM_R)CAP(RAM_W)
Loop 672execMultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)Innermost0.220.2233.091.001.001.001.301100.00100.007.452.0252.790.347.080.523.253.253.253.252.502526724848485.6735.461806.540.000.151.36168.526.06649.471.21323.79NA32NANA15NANANA
Bucket 6MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)75.371.001.001.001.301100.00100.007.452.0252.790.347.080.523.253.253.253.252.500.000.151.36168.526.06649.471.21323.79NA32NANA15NANANA
Bucket 5MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)18.931.001.001.001.301100.00100.004.632.0533.211.227.170.523.253.253.253.252.500.000.242.171613.559.646415.061.93326.02NA32NANA15NANANA
Bucket 7MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)2.841.001.001.001.301100.00100.007.492.0754.290.447.250.543.253.253.253.252.500.000.151.33168.295.89649.211.18323.68NA32NANA15NANANA
Bucket 4MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)1.891.001.001.001.301100.00100.002.142.0215.130.577.080.503.253.253.253.252.500.000.534.761629.7521.166433.064.233213.22NA32NANA15NANANA
Bucket 3MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)0.71.001.001.001.301100.00100.00NANANANANANA3.253.253.253.252.50NANANA16NANA64NANA32NANA32NANA15NANANA
Bucket 9MultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)0.191.001.001.001.301100.00100.00NANANANANANA3.253.253.253.252.50NANANA16NANA64NANA32NANA32NANA15NANANA
Loop 679execTinyVectorOps.h:59-59,MultiBsplineData.hpp:71-71,MultiBsplineRef.hpp:249-270miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)Innermost0.160.1623.531.101.001.001.171100.00100.005.101.2097.830.2619.170.0813.5012.2513.5013.5011.507372848484828.0871.46787.880.000.333.431621.479.246414.446.543220.44NA32NANA15NANANA
Bucket 7TinyVectorOps.h:59-59,MultiBsplineData.hpp:71-71,MultiBsplineRef.hpp:249-270miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)72.311.101.001.001.171100.00100.005.101.2097.830.2619.170.0813.5012.2513.5013.5011.500.000.333.431621.479.246414.446.543220.44NA32NANA15NANANA
Bucket 6TinyVectorOps.h:59-59,MultiBsplineData.hpp:71-71,MultiBsplineRef.hpp:249-270miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)27.331.101.001.001.171100.00100.003.241.2061.960.2019.130.0513.5012.2513.5013.5011.500.000.525.421633.9014.596422.8010.333232.28NA32NANA15NANANA
Bucket 9TinyVectorOps.h:59-59,MultiBsplineData.hpp:71-71,MultiBsplineRef.hpp:249-270miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)0.231.101.001.001.171100.00100.00NANANANANANA13.5012.2513.5013.5011.50NANANA16NANA64NANA32NANA32NANA15NANANA
Loop 971execParticleBConds.h:185-217void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) constSingle0.070.0711.031.031.001.011.44190.9189.221.101.5992.440.0284.150.0256.0054.5056.0055.5039.0047712852.719684.589.421615.52.341.479.261657.8918.356428.672.77328.65NA32NANA15NANANA
Bucket 7ParticleBConds.h:185-217void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) const98.691.031.001.011.44190.9189.221.101.5992.440.0284.150.0256.0054.5056.0055.5039.002.341.479.261657.8918.356428.672.77328.65NA32NANA15NANANA
Bucket 8ParticleBConds.h:185-217void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) const1.071.031.001.011.44190.9189.221.751.75108.000.1792.500.1156.0054.5056.0055.5039.002.001.267.931649.5515.706424.542.37327.41NA32NANA15NANANA
Loop 654execBsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]Innermost0.020.022.941.001.001.001.251100.00100.0011337.472.1436374.380.063.210.571.251.251.251.251.00640004848483.0862.8637258.170.000.000.00160.000.00640.000.00320.01NA32NANA15NANANA
Bucket 12BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]80.921.001.001.001.251100.00100.0011337.472.1436374.380.063.210.571.251.251.251.251.000.000.000.00160.000.00640.000.00320.01NA32NANA15NANANA
Bucket 4BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]8.721.001.001.001.251100.00100.002.533.1411.920.464.710.401.251.251.251.251.000.000.670.67164.205.37648.395.373216.78NA32NANA15NANANA
Bucket 5BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]4.841.001.001.001.251100.00100.005.893.0326.751.664.540.351.251.251.251.251.000.000.300.30161.872.39643.742.39327.48NA32NANA15NANANA
Bucket 3BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]3.491.001.001.001.251100.00100.001.892.978.420.054.460.321.251.251.251.251.000.000.950.95165.947.606411.887.603223.76NA32NANA15NANANA
Bucket 8BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]1.441.001.001.001.251100.00100.0040.572.97180.880.044.460.321.251.251.251.251.000.000.040.04160.280.35640.550.35321.11NA32NANA15NANANA
Bucket 6BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]0.151.001.001.001.251100.00100.00NANANANANANA1.251.251.251.251.00NANANA16NANA64NANA32NANA32NANA15NANANA
Bucket 2BsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]0.021.001.001.001.251100.00100.00NANANANANANA1.251.251.251.251.00NANANA16NANA64NANA32NANA32NANA15NANANA
Loop 230execBsplineFunctor.h:236-241qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) constSingle0.010.012.211.151.001.5311.50292.6871.67NANANANANANA11.5010.0011.507.531.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 780execOperatorTags.h:63-63,inner_product.hpp:81-82,DiracDeterminantRef.cpp:157-157miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool)Innermost0.010.011.471.002.006.861.00125.0015.63NANANANANANA8.008.004.001.178.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 228execBsplineFunctor.h:246-260qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) constSingle0.010.011.471.031.001.001.061100.0089.39NANANANANANA17.0016.5017.0017.0016.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 991execostream:667-667,Tensor.h:213-213,OperatorTags.h:43-183,char_traits.h:409-409,ParticleIOUtility.h:70-91,OhmmsVector.h:223-223,TinyVectorTensorOps.h:150-152,InfoStream.h:37-37void qmcplusplus::expandSuperCell(qmcplusplus::ParticleSet&, qmcplusplus::Tensor const&)Innermost000.741.581.697.722.74842.6217.32NANANANANANA26.0016.5015.383.379.50NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1263exec__intel_avx_rep_memcpySingle000.741.001.002.008.001100.0050.00NANANANANANA8.008.008.004.001.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 241execTwoBodyJastrowRef.h:153-154miniqmcreference::TwoBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&)Innermost000.741.001.001.001.001100.00100.00NANANANANANA4.004.004.004.004.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1264exec__intel_avx_rep_memsetSingle000.741.001.002.008.001100.0050.00NANANANANANA8.008.008.004.001.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 778execinner_product.hpp:81-82miniqmcreference::DiracDeterminantRef >::ratio(qmcplusplus::ParticleSet&, int)Single000.741.001.001.001.001100.00100.00NANANANANANA4.004.004.004.004.00NANANANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
×