Jenia Jitsev

Results 13 comments of Jenia Jitsev

@janEbert , @mehdidc & everyone: It seems Eleuther AI folks work on training and then releasing a publicly available large GPT version (175B one), and they as well use a...

> Well, we are lucky enough to use the DeepSpeed library itself so we have stage 2 working already! I can't test stage 3 as I don't have access to...

I think this is indeed a very important feature, and should be highly prioritized. For instance, chatGPT is not able to put any proper evaluation of uncertainty about right or...