stable-diffusion-webui-tensorrt icon indicating copy to clipboard operation
stable-diffusion-webui-tensorrt copied to clipboard

No real speedup

Open Ph0rk0z opened this issue 1 year ago • 2 comments

I had hoped this would help make generation faster on my P40. Unfortunately it only seems to add a few seconds to generation time. I made the model work from 512-768 and tried converting in both fp16 and fp32. A matter of 21 sec vs 24 sec for tensort to make a 704x704 image.

I will see if there's some difference on my 3090. I at least hope there is. OS was linux.

So far got a bigger speedup from getting rid of that ancient transformers and accelerate version pushed via requirements.

Ph0rk0z avatar Jul 28 '23 20:07 Ph0rk0z

same on V100, same speed as xformers.

Daishijun avatar Sep 14 '23 18:09 Daishijun

Adding shared.sd_model.model.diffusion_model = current_unet after current_unet.activate() will enable TrtUnet and speed it up.

fpzh2011 avatar Dec 15 '23 07:12 fpzh2011