OpenRLHF
OpenRLHF copied to clipboard
fix vLLM v0.4.1
Convergence testing is also required
test failed with TP=1 test passed with TP=2
TP=1 fixed
Convergence testing is also required
test failed with TP=1 test passed with TP=2
TP=1 fixed