LLaMA-Factory icon indicating copy to clipboard operation
LLaMA-Factory copied to clipboard

How to add a large amount of new special tokens?

Open Zhang-Each opened this issue 1 month ago • 1 comments

Reminder

  • [x] I have read the above rules and searched the existing issues.

System Info

I plan to add a large amount of special tokens to the LLM vocab (e.g. 1000 new tokens). I found that the parameter--new_special_tokens can be used but what if I want to add more tokens? Should I pass all the new tokens in the parameter or there is a more flexible way? Thank you!

Reproduction

Put your message here.

Others

No response

Zhang-Each avatar Oct 30 '25 01:10 Zhang-Each

You can refer to this PR https://github.com/hiyouga/LLaMA-Factory/pull/9267

Kuangdd01 avatar Oct 30 '25 15:10 Kuangdd01