mesa icon indicating copy to clipboard operation
mesa copied to clipboard

worse performance vs. llvm & amdvlk in compute shader raytracing synthie

Open aufkrawall opened this issue 6 years ago • 5 comments

The 8x8 register version scores ~700fps minimum with llvm, while it's ~640fps with aco. Latest amdvlk-open scores ~900fps. This is on RX 570. I tried to create a RD capture, but it didn't work (neither on Windows with 32 bit RD version).

Binaries are attached. They are runtime packed and thus like to trigger AVs, which is why there is password protection for the archive (PW "csrt"). csrt.zip You need DXVK and perhaps other lib32 dependencies.

aufkrawall avatar Jul 29 '19 14:07 aufkrawall

This zip file looks to be malformed or using some weird compression.

Could you upload tar.gz instead please?

baryluk avatar Oct 04 '19 22:10 baryluk

csrt.tar.gz

pendingchaos avatar Oct 04 '19 22:10 pendingchaos

I can run the application and can reproduce the performance issue

pendingchaos avatar Oct 04 '19 22:10 pendingchaos

For 8x8 version, using Mesa master branch (git-ce76be9933, plus some extra not yet merged NIR patches) with LLVM 10~svn372920, Wine Staging 4.12.1, DXVK 1.4.1, and MS' d3dcompiler_43.dll, AMD Fury X (FIJI), I am getting 900 fps for both LLVM and ACO. LLVM is however slightly faster, ACO takes about 1.12ms on average per frame, and LLVM takes about 1.08ms per frame.

csrt

baryluk avatar Oct 04 '19 23:10 baryluk

NIR's GCM optimization pass (even with global value numbering disabled) seems to help.

pendingchaos avatar Jan 30 '20 14:01 pendingchaos

Hi,

This repo is no longer being maintained since ACO was merged in upstream Mesa since late 2019. If you have issues with it, please open an issue in upstream Mesa here: https://gitlab.freedesktop.org/mesa/mesa/-/issues - select the Radeon Vulkan bug report template and fill out the details.

Venemo avatar May 17 '23 15:05 Venemo