Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution
1.05
1.05
1.06
1.05
1.06
No Scalar Integer
Potential Speedup
1.00
1.00
1.00
1.00
1.00
Nb Loops to get 80%
2
2
1
1
2
FP Vectorised
Potential Speedup
1.00
1.01
1.01
1.00
1.00
Nb Loops to get 80%
2
3
4
2
1
Fully Vectorised
Potential Speedup
1.02
1.01
1.02
1.01
1.00
Nb Loops to get 80%
5
5
6
5
1
Only FP Arithmetic
Potential Speedup
1.01
1.00
1.00
1.01
1.01
Nb Loops to get 80%
4
3
4
4
4
Cumulated Speedup If No Scalar Integer
Cumulated Speedup If FP Vectorized
Cumulated Speedup If Fully Vectorized
Cumulated Speedup If Only FP Arithmetic
Loop Based Profiles
Innermost / Single Loops
Inbetween Loops
Outermost Loops
Cumulated Coverage With All Loops
Innermost Loop Based Profiles
Coverage
Count
Application Categorization
Time
Coverage
Compilation Options
Source Object
Issue
▼exec–
▼init_us_2_acc.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼vloc_psi.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼fft_scatter_2d.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼sort.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼usnldiag.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
Source Object
Issue
▼exec–
▼thread_util.f90-pp.f90–
○
-march=(target) is missing.
▼usnldiag.f90-pp.f90–
○
-march=(target) is missing.
▼vloc_psi.f90-pp.f90–
○
-march=(target) is missing.
▼sort.f90–
○
-march=(target) is missing.
▼fft_scatter_2d.f90–
○
-march=(target) is missing.
▼init_us_2_acc.f90–
○
-march=(target) is missing.
Source Object
Issue
▼exec–
▼thread_util.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼init_us_2_acc.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼fft_scatter_2d.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼vloc_psi.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼usnldiag.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼make_pointlists.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
▼sort.f90–
○
-march=x86-64 is used but it should be replaced by a more architecture specific option or -march=native.
○
-funroll-loops is missing.
Source Object
Issue
▼exec–
▼init_us_2_acc.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼vloc_psi.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼usnldiag.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼sort.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.
▼fft_scatter_2d.f90–
○
For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches.