alignment-handbook
alignment-handbook copied to clipboard
SmolLM2 minor issues
Hello,
On a fresh conda env install following the README, I get two minor annoyances when trying to run SFT on SmolLM2 with
ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/deepspeed_zero3.yaml scripts/run_sft.py recipes/smollm2/sft/config_smol.yaml
- I get an unused parameter exception for
use_flash_attention_2fromH4ArgumentParser. Deleting the corresponding line from the config solves the problem (I think). - I get a
ValueError: Chat template is already added to the tokenizer. If you want to overwrite it, please set it to Nonefrom the setup_chat_format [here](url). Adding
if hasattr(tokenizer, "chat_template") and tokenizer.chat_template is not None:
tokenizer.chat_template = None # Reset the chat template
solves the problem (I think).
I don't know if I am doing something wrong or if the solutions I mentioned are actual solutions.
Thanks