OV - Compare Loops

Loops

▶MultiBsplineRef.hpp: 68 - 39.19 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 68-71						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 68-70
956	17.30	16.43	19.27	100	50	0	924	17.84	17.04	19.92	100	25	0

Sum on 1 analyzed binary loop (exec - 956)							Sum on 1 analyzed binary loop (exec - 924)
Analysis						Count	Analysis						Count
Data Access Issues							Data Access Issues
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access
Vectorization Roadblocks							Vectorization Roadblocks
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access

▶SoaDistanceTableAAOMPTarget.h: 440 - 31.19 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 244-244 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 263-263 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 182-182 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/SoaDistanceTableAAOMPTarget.h: 440-442						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 244-244 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 263-263 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 182-182 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/OhmmsVector.h: 223-223 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/SoaDistanceTableAAOMPTarget.h: 440-442
2539	13.69	13.25	15.54	27.27	15.91	0	1962	14.08	13.39	15.65	54.55	15.91	0

Sum on 1 analyzed binary loop (exec - 2539)							Sum on 1 analyzed binary loop (exec - 1962)
Analysis						Count	Analysis						Count
Loop Computation Issues							Loop Computation Issues
Presence of a large number of scalar integer instructions						1	Presence of a large number of scalar integer instructions						1
Data Access Issues							Data Access Issues
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access						1
Vectorization Roadblocks							Vectorization Roadblocks
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access						1

▶MultiBsplineRef.hpp: 242 - 23.39 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 242-262						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 61-61 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 242-262
1021	12.19	11.21	13.15	15.22	14.4	0	933	9.28	8.76	10.24	100	50	0

Sum on 1 analyzed binary loop (exec - 1021)							Sum on 1 analyzed binary loop (exec - 933)
Analysis						Count	Analysis						Count
Data Access Issues							Data Access Issues
More than 10% of the vector loads instructions are unaligned							More than 10% of the vector loads instructions are unaligned						1

▶SoaDistanceTableABOMPTarget.h: 228 - 14.73 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/Lattice/ParticleBConds3DSoa.h: 280-298 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/SoaDistanceTableABOMPTarget.h: 228-228						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/Lattice/ParticleBConds3DSoa.h: 280-298 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/SoaDistanceTableABOMPTarget.h: 228-228
3109	7.27	6.51	7.64	12.24	14.03	0	2217	6.65	6.06	7.09	12.77	14.1	0

Sum on 1 analyzed binary loop (exec - 3109)							Sum on 1 analyzed binary loop (exec - 2217)
Analysis						Count	Analysis						Count
Loop Computation Issues							Loop Computation Issues
Presence of expensive FP instructions						1	Presence of expensive FP instructions						1

▶BsplineFunctor.h: 236 - 4.09 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 236-241						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 236-241 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 107-107 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/OneBodyJastrowRef.h: 134-134
255	0.07	0.03	0.03	0	10	0	410	1.91	1.67	1.95	94.12	47.43	0
382	2.12	1.77	2.07	0	10	0	330	0.05	0.03	0.03	93.33	47.08	0

Sum on 1 analyzed binary loop (exec - 382)							Sum on 1 analyzed binary loop (exec - 410)
Analysis						Count	Analysis						Count
Loop Computation Issues							Loop Computation Issues
Presence of a large number of scalar integer instructions							Presence of a large number of scalar integer instructions						1
Control Flow Issues							Control Flow Issues
Presence of more than 4 paths						1	Presence of more than 4 paths
Data Access Issues							Data Access Issues
Presence of indirect access							Presence of indirect access						1
More than 10% of the vector loads instructions are unaligned							More than 10% of the vector loads instructions are unaligned						1
Presence of special instructions executing on a single port							Presence of special instructions executing on a single port						1
More than 20% of the loads are accessing the stack							More than 20% of the loads are accessing the stack						1
Vectorization Roadblocks							Vectorization Roadblocks
Presence of more than 4 paths						1	Presence of more than 4 paths						0
Presence of indirect access						0	Presence of indirect access						1
Inefficient Vectorization							Inefficient Vectorization
Presence of special instructions executing on a single port							Presence of special instructions executing on a single port						1
Use of masked instructions							Use of masked instructions						1

▶inner_product.hpp: 155 - 2.63 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 155-155 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/PETE/OperatorTags.h: 63-63 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/PETE/OperatorTags.h: 92-94						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 155-155 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/PETE/OperatorTags.h: 63-63 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/DiracDeterminantRef.cpp: 109-109 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/DiracDeterminantRef.cpp: 238-238 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/DiracDeterminantRef.cpp: 157-157 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 61-61
1127	0.54	0.41	0.48	100	50	0	1034	0.20	0.13	0.15	33.33	16.67	0
1139	0.09	0.06	0.07	100	50	0	1031	0.12	0.08	0.09	33.33	16.67	0
1131	0.54	0.47	0.55	100	50	0	1033	0.69	0.52	0.60	33.33	16.67	0
1123	0.10	0.05	0.06	100	50	0	1046	0.64	0.53	0.62	33.33	16.67	0

Sum on 1 analyzed binary loop (exec - 1131)							Sum on 2 analyzed binary loops (exec - 1033, exec - 1046)
Analysis						Count	Analysis						Count
Data Access Issues							Data Access Issues
More than 10% of the vector loads instructions are unaligned						1	More than 10% of the vector loads instructions are unaligned						0
Presence of special instructions executing on a single port						1	Presence of special instructions executing on a single port						1
Inefficient Vectorization							Inefficient Vectorization
Presence of special instructions executing on a single port						1	Presence of special instructions executing on a single port						1

▶ParticleBConds3DSoa.h: 235 - 1.48 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/Lattice/ParticleBConds3DSoa.h: 235-256
		1419	1.54	1.27	1.48	91.26	46.72	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		Sum on 1 analyzed binary loop (exec - 1419)
Analysis	Count	Analysis						Count
		Loop Computation Issues
		Presence of expensive FP instructions						1
		Data Access Issues
		Presence of indirect access						1
		More than 10% of the vector loads instructions are unaligned						1
		Presence of special instructions executing on a single port						1
		Vectorization Roadblocks
		Presence of indirect access						1
		Inefficient Vectorization
		Presence of special instructions executing on a single port						1

▶TwoBodyJastrowRef.h: 342 - 1.42 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 342-347						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 342-347
619	0.71	0.58	0.69	100	50	0	380	0.30	0.21	0.25	100	50	0
							378	0.32	0.20	0.24	100	50	0
							376	0.29	0.21	0.24	100	50	0

Sum on 1 analyzed binary loop (exec - 619)							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count
Data Access Issues
Presence of constant non-unit stride data access						1
More than 10% of the vector loads instructions are unaligned						1
Vectorization Roadblocks
Presence of constant non-unit stride data access						1

▶ParticleBConds3DSoa.h: 237 - 1.37 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Particle/Lattice/ParticleBConds3DSoa.h: 237-255						Loop Source Regions
2888	0.16	0.09	0.11	100	50	0
2889	0.10	0.04	0.05	100	50	0
2625	0.13	0.07	0.08	100	50	0
2653	0.60	0.47	0.55	100	50	0
2652	0.63	0.49	0.57	100	50	0

Sum on 2 analyzed binary loops (exec - 2653, exec - 2652)							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis	Count
Loop Computation Issues
Presence of expensive FP instructions						1

▶einspline_spo_ref.hpp: 223 - 1.21 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/einspline_spo_ref.hpp: 223-227 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 145-145						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 231-231 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/VectorSoAContainer.h: 271-271 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/einspline_spo_ref.hpp: 223-227 /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_vector.h: 1046-1046 /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_algobase.h: 235-235 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/TinyVector.h: 145-145 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/OhmmsPETE/OhmmsVector.h: 223-223
1025	0.66	0.50	0.58	11.11	13.89	0	926	0.73	0.54	0.63	20	13.13	0

Sum on 1 analyzed binary loop (exec - 1025)							Sum on 1 analyzed binary loop (exec - 926)
Analysis						Count	Analysis						Count
Loop Computation Issues							Loop Computation Issues
Presence of a large number of scalar integer instructions							Presence of a large number of scalar integer instructions						1
Data Access Issues							Data Access Issues
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access						1
Presence of special instructions executing on a single port						1	Presence of special instructions executing on a single port						1
Vectorization Roadblocks							Vectorization Roadblocks
Presence of constant non-unit stride data access						1	Presence of constant non-unit stride data access						1
Inefficient Vectorization							Inefficient Vectorization
Presence of special instructions executing on a single port						1	Presence of special instructions executing on a single port						1

▶inner_product.hpp: 82 - 1.21 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 82-83						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 82-83
1132	0.29	0.22	0.26	100	50	0	936	0.32	0.22	0.26	100	50	0
1124	0.12	0.07	0.08	100	50	0	1037	0.13	0.07	0.08	100	50	0
1138	0.05	0.02	0.02	100	50	0	1029	0.05	0.02	0.02	100	50	0
1126	0.32	0.21	0.25	100	50	0	1048	0.30	0.20	0.24	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶<unknown>: 0 - 0.98 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions							Loop Source Regions
829	0.01	0.00	0.00	0	0	0	347	0.03	0.01	0.01	0	0	0
1403	0.02	0.00	0.00	0	0	0	1648	0.01	0.00	0.00	0	0	0
244	0.01	0.00	0.00	0	0	0	1662	0.01	0.00	0.00	0	0	0
115	0.00	0.00	0.00	0	0	0	1634	0.00	0.00	0.00	0	0	0
367	0.00	0.00	0.00	0	0	0	1291	0.01	0.00	0.00	0	0	0
362	0.01	0.00	0.00	0	0	0	372	0.01	0.00	0.00	0	0	0
1026	0.01	0.00	0.00	0	0	0	374	0.02	0.00	0.00	0	0	0
374	0.01	0.00	0.00	0	0	0	1040	0.00	0.00	0.00	0	0	0
377	0.01	0.00	0.00	0	0	0	1052	0.00	0.00	0.00	0	0	0
122	0.01	0.00	0.00	0	0	0	1019	0.00	0.00	0.00	0	0	0
1761	0.00	0.00	0.00	0	0	0	385	0.01	0.00	0.00	0	0	0
198	0.02	0.00	0.00	0	0	0	387	0.01	0.00	0.00	0	0	0
196	0.00	0.00	0.00	0	0	0	396	0.00	0.00	0.00	0	0	0
197	0.01	0.00	0.00	0	0	0	398	0.00	0.00	0.00	0	0	0
1410	0.01	0.00	0.00	0	0	0	392	0.00	0.00	0.00	0	0	0
1958	0.00	0.00	0.00	0	0	0	393	0.03	0.01	0.01	0	0	0
3111	0.00	0.00	0.00	0	0	0	140	0.01	0.00	0.00	0	0	0
2212	0.00	0.00	0.00	0	0	0	51	0.03	0.00	0.01	0	0	0
130	0.00	0.00	0.00	0	0	0	407	0.02	0.00	0.00	0	0	0
2686	0.00	0.00	0.00	0	0	0	2224	0.00	0.00	0.00	0	0	0
320	0.00	0.00	0.00	0	0	0	284	0.04	0.01	0.01	0	0	0
1760	0.00	0.00	0.00	0	0	0	403	0.02	0.00	0.00	0	0	0
2885	0.01	0.00	0.00	0	0	0	286	0.03	0.01	0.01	0	0	0
951	0.01	0.00	0.00	0	0	0	139	0.00	0.00	0.00	0	0	0
1454	0.00	0.00	0.00	0	0	0	77	0.00	0.00	0.00	0	0	0
200	0.01	0.00	0.00	0	0	0	128	0.00	0.00	0.00	0	0	0
201	0.00	0.00	0.00	0	0	0	131	0.01	0.00	0.00	0	0	0
1015	0.00	0.00	0.00	0	0	0	129	0.00	0.00	0.00	0	0	0
292	0.01	0.00	0.00	0	0	0	1039	0.01	0.00	0.00	0	0	0
665	0.00	0.00	0.00	0	0	0	278	0.00	0.00	0.00	0	0	0
664	0.00	0.00	0.00	0	0	0	279	0.01	0.00	0.00	0	0	0
155	0.00	0.00	0.00	0	0	0	134	0.01	0.00	0.00	0	0	0
2756	0.01	0.00	0.00	0	0	0	1665	0.04	0.00	0.00	0	0	0
662	0.00	0.00	0.00	0	0	0	1038	0.00	0.00	0.00	0	0	0
148	0.00	0.00	0.00	0	0	0	136	0.00	0.00	0.00	0	0	0
640	0.00	0.00	0.00	0	0	0	938	0.00	0.00	0.00	0	0	0
2126	0.02	0.00	0.00	0	0	0	302	0.01	0.00	0.00	0	0	0
2130	0.05	0.00	0.00	0	0	0	1288	0.01	0.00	0.00	0	0	0
660	0.00	0.00	0.00	0	0	0	1284	0.02	0.00	0.00	0	0	0
1340	0.00	0.00	0.00	0	0	0	1651	0.00	0.00	0.00	0	0	0
659	0.00	0.00	0.00	0	0	0	1041	0.01	0.00	0.00	0	0	0
2685	0.00	0.00	0.00	0	0	0	306	0.00	0.00	0.00	0	0	0
319	0.00	0.00	0.00	0	0	0	305	0.00	0.00	0.00	0	0	0
145	0.00	0.00	0.00	0	0	0	394	0.01	0.00	0.00	0	0	0
202	0.00	0.00	0.00	0	0	0	907	0.01	0.00	0.00	0	0	0
318	0.01	0.00	0.00	0	0	0	318	0.01	0.00	0.00	0	0	0
641	0.01	0.00	0.00	0	0	0	317	0.01	0.00	0.00	0	0	0
157	0.00	0.00	0.00	0	0	0	316	0.01	0.00	0.00	0	0	0
638	0.01	0.00	0.00	0	0	0	73	0.00	0.00	0.00	0	0	0
637	0.01	0.00	0.00	0	0	0	1006	0.00	0.00	0.00	0	0	0
1609	0.00	0.00	0.00	0	0	0	1417	0.01	0.00	0.00	0	0	0
153	0.00	0.00	0.00	0	0	0	1336	0.01	0.00	0.00	0	0	0
2540	0.01	0.00	0.00	0	0	0	327	0.01	0.00	0.00	0	0	0
615	0.01	0.00	0.00	0	0	0	1957	0.01	0.00	0.00	0	0	0
1435	0.02	0.00	0.00	0	0	0	322	0.00	0.00	0.00	0	0	0
952	0.01	0.00	0.00	0	0	0	323	0.01	0.00	0.00	0	0	0
140	0.02	0.01	0.01	0	0	0	1418	0.01	0.00	0.00	0	0	0
834	0.02	0.00	0.00	0	0	0	2220	0.01	0.00	0.00	0	0	0
632	0.03	0.01	0.01	0	0	0	321	0.01	0.00	0.00	0	0	0
1120	0.01	0.00	0.00	0	0	0	1664	0.05	0.00	0.00	0	0	0
825	0.01	0.00	0.00	0	0	0	2501	0.88	0.68	0.80	100	50	0
826	0.01	0.00	0.00	0	0	0	1660	0.02	0.00	0.00	0	0	0
							328	0.01	0.00	0.00	0	0	0
							285	0.03	0.01	0.01	0	0	0
							89	0.03	0.01	0.01	0	0	0
							1188	0.00	0.00	0.00	0	0	0
							350	0.00	0.00	0.00	0	0	0
							1120	0.01	0.00	0.00	0	0	0
							348	0.02	0.00	0.00	0	0	0
							132	0.01	0.00	0.00	0	0	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							Sum on 1 analyzed binary loop (exec - 2501)
Analysis						Count	Analysis						Count
							Data Access Issues
							Presence of constant non-unit stride data access						1
							Vectorization Roadblocks
							Presence of constant non-unit stride data access						1
							Out of user code						1

▶BsplineFunctor.h: 291 - 0.92 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 291-298						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 291-297
291	0.26	0.17	0.20	0	9.38	0	351	0.52	0.39	0.45	90.91	46.02	0
614	0.28	0.20	0.24	0	9.38	0
639	0.05	0.02	0.03	0	9.38	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶MultiBsplineRef.hpp: 276 - 0.70 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 276-286						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Numerics/Spline2/MultiBsplineRef.hpp: 276-286
1020	0.57	0.43	0.50	0	12.5	0	929	0.24	0.17	0.20	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶TwoBodyJastrowRef.h: 155 - 0.66 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 155-156						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 155-156
285	0.36	0.28	0.33	100	50	0	369	0.17	0.09	0.11	100	50	0
							371	0.15	0.10	0.12	100	50	0
							370	0.15	0.09	0.11	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶TwoBodyJastrowRef.h: 324 - 0.56 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 324-331						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 324-331
620	0.43	0.28	0.32	0	12.5	0	383	0.28	0.20	0.23	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶inner_product.hpp: 211 - 0.33 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 211-212						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/Platforms/CPU/SIMD/inner_product.hpp: 211-212
1149	0.22	0.18	0.21	33.33	16.67	0	1012	0.13	0.10	0.12	85.71	41.07	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶BsplineFunctor.h: 246 - 0.17 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/cluster/comp/gcc/14.2.0/include/c++/14.2.0/bits/stl_vector.h: 1147-1147 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 246-260						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 246-260
383	0.14	0.08	0.09	100	48.46	0	408	0.14	0.07	0.08	100	46.97	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶TwoBodyJastrowRef.h: 381 - 0.10 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 381-382						Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 381-382
633	0.09	0.04	0.05	100	50	0	397	0.05	0.02	0.02	100	50	0
							399	0.05	0.01	0.02	100	50	0
							395	0.06	0.01	0.02	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶stl_numeric.h: 140 - 0.07 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/cluster/comp/gcc/14.2.0/include/c++/14.2.0/bits/stl_numeric.h: 140-141						Loop Source Regions	/usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_numeric.h: 140-141 /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_iterator.h: 1182-1182
289	0.07	0.03	0.04	100	50	0	373	0.06	0.03	0.04	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis						Count

▶BsplineFunctor.h: 305 - 0.05 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/cluster/comp/gcc/14.2.0/include/c++/14.2.0/bits/stl_vector.h: 1147-1147 /home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 305-336						Loop Source Regions
293	0.07	0.03	0.03	99.42	48.77	0
616	0.05	0.02	0.02	99.42	48.77	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis	Count

▶BsplineFunctor.h: 303 - 0.05 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/BsplineFunctor.h: 303-338
		349	0.08	0.04	0.05	100	48.33	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

▶OneBodyJastrowRef.h: 192 - 0.04 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/OneBodyJastrowRef.h: 192-193						Loop Source Regions
830	0.05	0.02	0.02	100	50	0
370	0.01	0.01	0.01	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis	Count

▶stl_algobase.h: 911 - 0.03 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_algobase.h: 911-912
		1056	0.04	0.03	0.03	100	25	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

▶stl_algobase.h: 939 - 0.03 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/cluster/comp/gcc/14.2.0/include/c++/14.2.0/bits/stl_algobase.h: 939-940						Loop Source Regions
1185	0.04	0.03	0.03	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis	Count

▶TwoBodyJastrowRef.h: 393 - 0.03 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 393-398
		391	0.05	0.02	0.03	0	12.5	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

▶stl_algobase.h: 923 - 0.03 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_algobase.h: 923-924
		335	0.04	0.02	0.03	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

▶TwoBodyJastrowRef.h: 397 - 0.01 %

ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s	ASM Loop ID	GFLOP/s
Run gcc_3							Run icx_4
Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 397-398						Loop Source Regions
631	0.04	0.01	0.01	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.							No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis						Count	Analysis	Count

▶OneBodyJastrowRef.h: 186 - 0.01 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/OneBodyJastrowRef.h: 186-187
		289	0.03	0.01	0.01	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

▶TwoBodyJastrowRef.h: 375 - 0.01 %

ASM Loop ID	GFLOP/s	ASM Loop ID	Max Time Over Threads (s)	Time w.r.t. Wall Time (s)	Cov (%)	Vect. Ratio (%)	Vector Length Use (%)	GFLOP/s
Run gcc_3		Run icx_4
Loop Source Regions		Loop Source Regions	/home/eoseret/qaas_runs_GNR/173-773-5521/intel/miniqmc/build/miniqmc/src/QMCWaveFunctions/Jastrow/TwoBodyJastrowRef.h: 375-376
		401	0.03	0.01	0.01	100	50	0

No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.		No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count.
Analysis	Count	Analysis						Count

Report Configuration

Loops

▶MultiBsplineRef.hpp: 68 - 39.19 %

▶SoaDistanceTableAAOMPTarget.h: 440 - 31.19 %

▶MultiBsplineRef.hpp: 242 - 23.39 %

▶SoaDistanceTableABOMPTarget.h: 228 - 14.73 %

▶BsplineFunctor.h: 236 - 4.09 %

▶inner_product.hpp: 155 - 2.63 %

▶ParticleBConds3DSoa.h: 235 - 1.48 %

▶TwoBodyJastrowRef.h: 342 - 1.42 %

▶ParticleBConds3DSoa.h: 237 - 1.37 %

▶einspline_spo_ref.hpp: 223 - 1.21 %

▶inner_product.hpp: 82 - 1.21 %

▶<unknown>: 0 - 0.98 %

▶BsplineFunctor.h: 291 - 0.92 %

▶MultiBsplineRef.hpp: 276 - 0.70 %

▶TwoBodyJastrowRef.h: 155 - 0.66 %

▶TwoBodyJastrowRef.h: 324 - 0.56 %

▶inner_product.hpp: 211 - 0.33 %

▶BsplineFunctor.h: 246 - 0.17 %

▶TwoBodyJastrowRef.h: 381 - 0.10 %

▶stl_numeric.h: 140 - 0.07 %

▶BsplineFunctor.h: 305 - 0.05 %

▶BsplineFunctor.h: 303 - 0.05 %

▶OneBodyJastrowRef.h: 192 - 0.04 %

▶stl_algobase.h: 911 - 0.03 %

▶stl_algobase.h: 939 - 0.03 %

▶TwoBodyJastrowRef.h: 393 - 0.03 %

▶stl_algobase.h: 923 - 0.03 %

▶TwoBodyJastrowRef.h: 397 - 0.01 %

▶OneBodyJastrowRef.h: 186 - 0.01 %

▶TwoBodyJastrowRef.h: 375 - 0.01 %