llama-trl
llama-trl copied to clipboard
What's the advantage of this library compare to the official TRL?
Perhaps the smallest implementation of TRL, with simple logic.