LifeIsStrange
LifeIsStrange
@KhalilMrini
@matyascorvinus Noob question: what cloud do you recommend for training large neural networks?
no build error but crash on run python demo.py final Traceback (most recent call last): File "demo.py", line 7, in import coref_model as cm File "/home/stephane/coref2/3/e2e-coref/coref_model.py", line 17, in import...
todo try 1.12
note Deepspeed Mii would be better https://github.com/microsoft/DeepSpeed-MII
see the benchmark for stablediffusion (V1?): https://github.com/microsoft/DeepSpeed-MII/tree/main/examples/benchmark/txt2img
@minimaxir it will not *necessarily* bring gains but it is still a low hanging fruit that should be tried.
@digantamisra98 RAdam (not the full Ranger package) does not increase computational cost. I've read somewhere that Mish can be as efficient as Relu Maybe with https://github.com/thomasbrandon/mish-cuda?
@edobobo Yes I have downloaded a dataset and looked at the format. It is unclear to me how to transform a given sentence in the expected format... when to set...
Thanks a lot, is there a downside to requiring every words to be instance ?