phalexo comments

Results 137 comments of


                                            phalexo

[Notes] Provide some notes about using DeepFloyd

It may be helpful for people to know that different stages can be put on different GPUs, for those who have GPUs with smaller VRAM but several of them.

Lighter text encoder

If by lighter you mean requiring less VRAM, then you can try to set the dtype to float16. T5 is then about 11.6GiB.

CUDA out of memory.

T5 needs about 11.6GiB If_I needs about 9.2GiB if_II + if_III need about 5.8GiB, each separately about 3GiB

torch.compile for improved performance/AssertionError

If stages are not callable then it is not clear to me which parts are callable. Did you manage to compile anything? On Mon, May 15, 2023, 6:46 PM Mohsen...

torch.compile for improved performance/AssertionError

I just want the inference to be faster. Can you paste the snippet of code that worked for you? On Mon, May 15, 2023, 7:45 PM Mohsen Sadeghi ***@***.***> wrote:...

torch.compile for improved performance/AssertionError

Regardless where I put it, either it complains about parallelism or simply hangs. What was the specific location where you put it "torch.compile" or "@torch.compile" ?

Close, but no banana.

Not quite the crisp image above. ![Joker1](https://user-images.githubusercontent.com/4603365/236705204-fc07a141-cf1d-4b97-80c0-ee934e1bbe89.png)

Close, but no banana.

![Joker2](https://user-images.githubusercontent.com/4603365/236706254-91494130-512a-4e55-995e-a0b862c6b40b.png)

Close, but no banana.

This kind of looks ok. ![Kitty1](https://user-images.githubusercontent.com/4603365/236713770-7343fa9e-fd9b-4172-bb74-991c1fa896cd.png)

Close, but no banana.

> @phalexo; I'm running all the full models in full resolution on a 48G VRAM RTX A6000 instance on RunPod[1]. What are you using? > > [1] This is not...