verifiers
verifiers copied to clipboard
Clarify parallelize_scoring=False pattern + fix simpleqa example
Based on my understanding in simpleqa we do not need multiple .judge calls and can pass the cache state from the first one to generate reward in 2nd and 3rd functions. Opened a PR.