worse performance vs. llvm & amdvlk in compute shader raytracing synthie
The 8x8 register version scores ~700fps minimum with llvm, while it's ~640fps with aco. Latest amdvlk-open scores ~900fps. This is on RX 570. I tried to create a RD capture, but it didn't work (neither on Windows with 32 bit RD version).
Binaries are attached. They are runtime packed and thus like to trigger AVs, which is why there is password protection for the archive (PW "csrt"). csrt.zip You need DXVK and perhaps other lib32 dependencies.
This zip file looks to be malformed or using some weird compression.
Could you upload tar.gz instead please?
I can run the application and can reproduce the performance issue
For 8x8 version, using Mesa master branch (git-ce76be9933, plus some extra not yet merged NIR patches) with LLVM 10~svn372920, Wine Staging 4.12.1, DXVK 1.4.1, and MS' d3dcompiler_43.dll, AMD Fury X (FIJI), I am getting 900 fps for both LLVM and ACO. LLVM is however slightly faster, ACO takes about 1.12ms on average per frame, and LLVM takes about 1.08ms per frame.

NIR's GCM optimization pass (even with global value numbering disabled) seems to help.
Hi,
This repo is no longer being maintained since ACO was merged in upstream Mesa since late 2019. If you have issues with it, please open an issue in upstream Mesa here: https://gitlab.freedesktop.org/mesa/mesa/-/issues - select the Radeon Vulkan bug report template and fill out the details.