Bruno Oliveira

Results 4 issues of Bruno Oliveira

If I got this right, currently if you try to eval an EnvGroup with a `--num-samples` smaller than the EnvGroup's first env, the eval will run only on that first...

## Description My attempt to implement #438. I ran into a few edge-cases I need to solve before this can be merged. I successfully tested the re-score workflow with the...

## Description This PR adds a `--no-interleave-scoring` flag to `vf-eval` so we can control whether `evaluate` will do interleaved scoring or not. ## Type of Change - [ ] Bug...

## Description Adds a QoL "go to" input to the TUI, so it's easier to test/debug an env while developing it. ## Type of Change - [ ] Bug fix...