diffusers Inference speed

Inference speed

Open rishabh063 opened this issue 1 year ago • 3 comments

What is the fastest way to do inference with sdxl and sd1.5 these days without suffering on quality?

Seen a lot of 3rd party libs like openvivo , stable fast , stream diffusion , deepcache etc

But there is some quality loss or problems in inference.

Looking for inpainting specifically

Apr 09 '24 21:04 rishabh063

Have you looked at this discussion too? Also, if you haven't encountered, there are fast sampling techniques to research about. There are trade-offs most of the time -no free lunch, no silver bullet etc.

Apr 10 '24 05:04 tolgacangoz

bf 16 & sdpa seem to have no tradeoff IMO ,

Is there way to compile for multiple output dimensions ?

Apr 10 '24 23:04 rishabh063

@rishabh063 The Discussions section would be a better place for this question.

Apr 13 '24 17:04 DN6

bf 16 & sdpa seem to have no tradeoff IMO ,

Is there way to compile for multiple output dimensions ?

(A discussion was not opened, so I am posting here.)

One can determine a static maximum output shape and then mask out unused parts?

Jul 29 '24 08:07 tolgacangoz

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Sep 14 '24 15:09 github-actions[bot]

diffusers diffusers copied to clipboard

Inference speed

diffusers
diffusers copied to clipboard