Ryan Dick comments

Results 41 comments of


                                            Ryan Dick

Depth Anything V2

> Did not consider coz .. well I didn't know it was implemented there. Easier to do it this way but one advantage of keeping the source might be to...

[bug]: Clearing queues in mid generation does not free VRAM, even when configured to do so

This actually sounds like the expected behaviour. We no longer support non-lazy model loading. So, with lazy offloading and the VRAM cache limit so low, only the last used model...

[bug]: Clearing queues in mid generation does not free VRAM, even when configured to do so

That proposal sounds reasonable and relatively straightforward from an implementation perspective. My main concern is that it might be hard for a user to find a `clear_cache_after` value that is...

Use Diffusers 0.30.0 / Enable offline single file model loading

Link to the diffusers issue for reference: https://github.com/huggingface/diffusers/issues/9171

Redo custom attention processor to support other attention types

I haven't looked at the code yet, but do you know if there are still use cases for using attention processors other than Torch 2.0 SDP? Based on the benchmarking...

Redo custom attention processor to support other attention types

I thought about this some more, and I'm hesitant to proceed with trying to merge this until we have more clarity around which attention implementations we actually want to support....

Redo custom attention processor to support other attention types

Not for this PR, but I did some performance testing and we'll probably want to address this at some point: SDXL: ```bash >>> Time taken to prepare attention processors: 0.10069823265075684s...

Redo custom attention processor to support other attention types

It looks like there was a significant re-write of the attention logic after the latest round of review and testing on this PR. @StAlKeR7779 can you shed some light on...

[bug]: Crash using Flux Dev.1

@gigend @KudintG @tensorflow73 Just to confirm, are you all seeing `Process exited with code: 3221225477`? Or just the same warnings that lead up to it? And, can you all confirm...

[bug]: Crash using Flux Dev.1

> I've found that the base Flux schnell model with the standard T5 works fine, but other flux models crash it out with the error pointing to the flash attention...