ParallelStencil.jl
ParallelStencil.jl copied to clipboard
[JuliaCon/proceedings-review] Performance metrics
Hi all,
q1) what is the reason behind focusing on T_eff and not on Gpts/s as commonly used in papers reporting stencil performance?
q2) Figure 2 shows that using the math-close notation, performance slightly drops compared to explicitly expressing the stencil computation. Where is this slowdown coming from?