alfonsoborello
Results
1
comments of
alfonsoborello
The 1558M doesn't make much of a difference from what I experienced. Finetuning on the 355M is good enough―so much fanfare for nothing, total publicity stunt. GPT-2 seems pretty much...