ReST-EM-pytorch
ReST-EM-pytorch copied to clipboard
Implementations and explorations into the ReST𝐸𝑀 algorithm in the new deepmind paper "Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models"