weiminw

Results 12 issues of weiminw

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm I want to run inference of a [specific model](put link here)....

usage

could you help to add the Reward model train support? When I use unsloth load the model, I found the model is not Reward Model structure (last layer is not...