__aarch64_ldadd8_acq_rel - Load Distribution

minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value


minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value

__aarch64_ldadd8_acq_rel - Sorted Load Distribution

minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value


minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value

__aarch64_ldadd8_acq_rel - Load Distribution All Threads

minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value


minmedavgmax
Percentile Index10 20 30 40 50 60 70 80 90 100
Value

__aarch64_ldadd8_acq_rel

Columns Filter

(1x1) Efficiency (1x1) Potential Speed-Up (%) (1x2) Efficiency (1x2) Potential Speed-Up (%) (1x4) Efficiency (1x4) Potential Speed-Up (%) (1x8) Efficiency (1x8) Potential Speed-Up (%) (1x16) Efficiency (1x16) Potential Speed-Up (%) (1x24) Efficiency (1x24) Potential Speed-Up (%) (1x32) Efficiency (1x32) Potential Speed-Up (%) (1x40) Efficiency (1x40) Potential Speed-Up (%) (1x48) Efficiency (1x48) Potential Speed-Up (%) (1x56) Efficiency (1x56) Potential Speed-Up (%) (1x64) Efficiency (1x64) Potential Speed-Up (%)
(1x1) Efficiency(1x1) Potential Speed-Up (%)(1x2) Efficiency(1x2) Potential Speed-Up (%)(1x4) Efficiency(1x4) Potential Speed-Up (%)(1x8) Efficiency(1x8) Potential Speed-Up (%)(1x16) Efficiency(1x16) Potential Speed-Up (%)(1x24) Efficiency(1x24) Potential Speed-Up (%)(1x32) Efficiency(1x32) Potential Speed-Up (%)(1x40) Efficiency(1x40) Potential Speed-Up (%)(1x48) Efficiency(1x48) Potential Speed-Up (%)(1x56) Efficiency(1x56) Potential Speed-Up (%)(1x64) Efficiency(1x64) Potential Speed-Up (%)
1020201.3307.960

Speed-Up and Efficiency

Coverage (%)NameSource LocationModule
100.00+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]BLAS.hpp:249exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_fork_calllibomp.so
__kmpc_fork_calllibomp.so
mainminiqmc.cpp:409exec
__libc_start_call_mainlibc.so.6
__libc_start_mainlibc.so.6
_startnew_allocator.h:172exec
Coverage (%)NameSource LocationModule
100.00+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]DelayedUpdate.h:192exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_invoke_task_funclibomp.so
__kmp_launch_threadlibomp.so
__kmp_launch_worker(void*)libomp.so
start_threadlibc.so.6
thread_startlibc.so.6
Coverage (%)NameSource LocationModule
100.00+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]BLAS.hpp:249exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_invoke_task_funclibomp.so
__kmp_launch_threadlibomp.so
__kmp_launch_worker(void*)libomp.so
start_threadlibc.so.6
thread_startlibc.so.6
Coverage (%)NameSource LocationModule
83.33+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]BLAS.hpp:249exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_invoke_task_funclibomp.so
__kmp_launch_threadlibomp.so
__kmp_launch_worker(void*)libomp.so
start_threadlibc.so.6
thread_startlibc.so.6
16.67+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]BLAS.hpp:249exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_invoke_task_funclibomp.so
__kmp_fork_calllibomp.so
__kmpc_fork_calllibomp.so
mainminiqmc.cpp:409exec
__libc_start_call_mainlibc.so.6
__libc_start_mainlibc.so.6
_startnew_allocator.h:172exec
Coverage (%)NameSource LocationModule
100.00+void armpl::clag::compute_impl[...]libarmpl_lp64.so
void armpl::clag::gemm<true, i[...]libarmpl_lp64.so
qmcplusplus::DelayedUpdate<dou[...]BLAS.hpp:249exec
void qmcplusplus::DelayedUpdat[...]DelayedUpdate.h:153exec
miniqmcreference::DiracDetermi[...]DiracDeterminantRef.cpp:129exec
qmcplusplus::WaveFunction::acc[...]stl_vector.h:993exec
main.omp_outlined.62miniqmc.cpp:450exec
__kmp_invoke_microtasklibomp.so
__kmp_invoke_task_funclibomp.so
__kmp_launch_threadlibomp.so
__kmp_launch_worker(void*)libomp.so
start_threadlibc.so.6
thread_startlibc.so.6
No callchains for this object
No callchains for this object
No callchains for this object
No callchains for this object
No callchains for this object
No callchains for this object
×