Srinivas Billa comments

Results 49 comments of


Srinivas Billa

[Feature]: FP6

@mgoin ohhh I see. Lol mb

Export to JIT script

To be fair, even if they do release it. I don't think it can be run on anything less than a 4090. And if you have a 4090, you can...

Export to JIT script

Then again, we could try use BitsandBytes

Export to JIT script

True. But still, a full pretrained model would be nice.

Export to JIT script

@RahulBhalley The author of the repo said he trained it on a single gpu(so something like a 3090?). In terms of time I'm not sure. But #58 said he trained...

Export to JIT script

@RahulBhalley using bnb is super easy ``` Import bitsandbytes as bnb optim_g = bnb.optim.AdamW(...) ``` You can use it as a drop in replacement.

Export to JIT script

I used it for finetuning vits and it saved me almost 3gb of vram https://github.com/nivibilla/efficient-vits-finetuning

Export to JIT script

I see. I mean see how much vram you save. If it's only something like 3gb. Is it really worth? The point of using 8bit optimisers is mainly for finetuning...

Export to JIT script

Have you had a look at the other Vall-E implementation? It uses Deepspeed.

Export to JIT script

Btw the original paper used AdamW