diffusers icon indicating copy to clipboard operation
diffusers copied to clipboard

Inference speed

Open rishabh063 opened this issue 1 year ago • 3 comments

What is the fastest way to do inference with sdxl and sd1.5 these days without suffering on quality?

Seen a lot of 3rd party libs like openvivo , stable fast , stream diffusion , deepcache etc

But there is some quality loss or problems in inference.

Looking for inpainting specifically

rishabh063 avatar Apr 09 '24 21:04 rishabh063

Have you looked at this discussion too? Also, if you haven't encountered, there are fast sampling techniques to research about. There are trade-offs most of the time -no free lunch, no silver bullet etc.

tolgacangoz avatar Apr 10 '24 05:04 tolgacangoz

bf 16 & sdpa seem to have no tradeoff IMO ,

Is there way to compile for multiple output dimensions ?

rishabh063 avatar Apr 10 '24 23:04 rishabh063

@rishabh063 The Discussions section would be a better place for this question.

DN6 avatar Apr 13 '24 17:04 DN6

bf 16 & sdpa seem to have no tradeoff IMO ,

Is there way to compile for multiple output dimensions ?

(A discussion was not opened, so I am posting here.)

One can determine a static maximum output shape and then mask out unused parts?

tolgacangoz avatar Jul 29 '24 08:07 tolgacangoz

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions[bot] avatar Sep 14 '24 15:09 github-actions[bot]