Help is available by moving the cursor above any symbol or by checking MAQAO website.
▶Filter Information
30 threads covering less than 1% of profiled time ( = Max (Thread Active Time)) were discarded, cumulating 0.37 seconds CPU time. You can adjust the threshold below which a thread will be discarded with the thread-filter-threshold option.
Global Metrics
Total Time (s)
38.29
Max (Thread Active Time) (s)
32.04
Average Active Time (s)
31.94
Activity Ratio (%)
83.4
Average number of active threads
53.385
Affinity Stability
27.2
Time in analyzed loops (%)
95.0
Time in analyzed innermost loops (%)
72.8
Time in user code (%)
95.0
Compilation Options Score (%)
100
Array Access Efficiency (%)
55.9
Potential Speedups
Perfect Flow Complexity
1.00
Perfect OpenMP + MPI + Pthread
1.03
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution
1.06
No Scalar Integer
Potential Speedup
1.74
Nb Loops to get 80%
2
FP Vectorised
Potential Speedup
1.44
Nb Loops to get 80%
2
Fully Vectorised
Potential Speedup
3.32
Nb Loops to get 80%
5
FP Arithmetic Only
Potential Speedup
1.47
Nb Loops to get 80%
2
CQA Potential Speedups Summary
Loop Based Profile⏎
Innermost Loop Based Profile⏎
Application Categorization⏎
Compilation Options⏎
Source Object
Issue
▼lbc–
○lb_init.F90
○mpl_set.F90
○tools.F90
○lbc.F90
○lbm_functions.F90
Loop Path Count Profile⏎
Cumulated Speedup If No Scalar Integer⏎
Cumulated Speedup If FP Vectorized⏎
Cumulated Speedup If Fully Vectorized⏎
Cumulated Speedup If FP Arithmetic Only⏎
Experiment Summary
Application
./../lbc/lbc
Timestamp
2024-11-27 14:51:00
Universal Timestamp
1732719060
Number of processes observed
64
Number of threads observed
64
Experiment Type
MPI; OpenMP;
Machine
ip-172-31-42-13
Architecture
aarch64
Micro Architecture
ARM_NEOVERSE_V1
OS Version
Linux 6.8.0-1016-aws #17~22.04.2-Ubuntu SMP Thu Sep 26 18:55:31 UTC 2024