Shafiq Jetha

Results 14 comments of Shafiq Jetha

Does it run forever or does it just take a really long time? I ask because the most efficient way of training and running inference on a model like this...

The gpt file would also benefit from this change, I think.

Yes. In his video, he does go over why he's doing this. You can see his explanation here: https://youtu.be/kCc8FmEb1nY?si=VFtUYR-MjtrjR-Lw&t=5722 It's because there has been a "reshuffling" of the structure, as...

Yes, but it will take a very long time to complete one iteration.