webml-polyfill icon indicating copy to clipboard operation
webml-polyfill copied to clipboard

[DML] optimize the usage of UAV barriers

Open huningxin opened this issue 5 years ago • 0 comments

According to UAV barriers and resource state barriers in DirectML example 2, ops without data dependency can execute concurrently on the GPU, which may improve performance. In current DML backend, the UAV barrier is inserted after each op execution without checking the data dependency which can be improved.

huningxin avatar Oct 08 '19 06:10 huningxin