DavideHe

Results 12 issues of DavideHe

as the article : https://sieunpark77.medium.com/a-late-review-of-openais-training-verifiers-to-solve-math-word-problems-0d457eb706e3 `For each training problem, we sample 100 completions from the generator and label each solution as correct or incorrect` as the words , I think...

After I check the project , I did not find the optimizer state saving code but only model params.

enhancement