RL4LMs
RL4LMs copied to clipboard
is multi-dimensional reward supported?
Hi, thanks for publishing this awesome library. Can I add a configuration / modify the reward.py to return a vector instead of a scalar reward?