optimum-habana
optimum-habana copied to clipboard
add mixtral trl sft
What does this PR do?
- add
pad_max
to pad inputs when sft with not packing, which is better for static shape - validate mixtral sft and add the training command
- validate mistral dpo pipeline