lightllm icon indicating copy to clipboard operation
lightllm copied to clipboard

Auto convert without tokenizer.json to prevent performance downgrade?

Open leiwen83 opened this issue 1 year ago • 2 comments

As mentioned in #20 , lightllm performance would downgrade a lot if without tokenizer.json. So for those model without this file, shall it be reasonable to add some auto conversion process in the server start to workaround this case?

Thx

leiwen83 avatar Aug 12 '23 08:08 leiwen83

Hi @leiwen83, Were you able to use the fast tokenizer in LightLLM? If yes, would you like to open a pull request ? 😄

llehtahw avatar Aug 13 '23 03:08 llehtahw

You can save fast tokenizer in advane, refer to fix slow tokenizer

Then auto load fast tokenizer if you want. I have modified in my code

super-buster avatar Aug 16 '23 02:08 super-buster