[Issue]: Performance of 9070xt with ComfyUI

Open alshdavid opened this issue 7 months ago • 67 comments

Problem Description

I'm using the default ComfyUI SDXL workflow with Ubuntu 24.04 and the proprietary AMD drivers. This is also the case using Fedora 42.

Issues:

Using the default launch settings, the generation crashes with OOM errors
Using tweaked settings, generation passes but is very slow

Results:

Clean Ubuntu 24.04 installation AMD proprietary drivers Python 3.10.17 Pytorch nightly ROCm 6.4.1

export TORCH_COMMAND="--pre torch torchvision torchaudio pytorch-triton-rocm --index-url https://download.pytorch.org/whl/nightly/rocm6.4"

python ./main.py

Card	Model	Steps	Resolution	Time	Notes
6900xt	SD1.5	20	512x512	2.42s	ROCm 6.3
9070xt	SD1.5	20	512x512	3.76s
6900xt	SDXL	20	1024x1024	15.16s	ROCm 6.3
9070xt	SDXL	20	1024x1024	FAIL	Crashed with out of memory
9070xt	SDXL	20	1024x1024	30.51s	Used tiled VAE decoder to avoid OOM failure

Results 2:

Clean Ubuntu 24.04 installation AMD proprietary drivers Python 3.10.17 Pytorch nightly ROCm 6.4.1

export PYTORCH_TUNABLEOP_ENABLED=1\
export TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1
export TORCH_COMMAND="--pre torch torchvision torchaudio pytorch-triton-rocm --index-url https://download.pytorch.org/whl/nightly/rocm6.4"

python ./main.py --use-pytorch-cross-attention

Model	Steps	Resolution	Speed	Time	Notes
SDXL	20	1024x1024	1.49it/s	34.56s
SDXL	20	1024x1024	1.5it/s	27.76s	Manual tiled VAE decoder

Operating System

Ubuntu 24.04

CPU

AMD 7950x

GPU

Radeon RX 9070xt

ROCm Version

6.4.1

ROCm Component

No response

Steps to Reproduce

Install Ubuntu 24.04
Install AMD proprietary drivers with ROCm
Install Python 3.10.17
Clone ComfyUI
Start ComfyUI
Use default workflow for SDXL

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

May 30 '25 10:05 alshdavid