lightllm Auto convert without tokenizer.json to prevent performance downgrade?

Auto convert without tokenizer.json to prevent performance downgrade?

Open leiwen83 opened this issue 1 year ago • 2 comments

As mentioned in #20 , lightllm performance would downgrade a lot if without tokenizer.json. So for those model without this file, shall it be reasonable to add some auto conversion process in the server start to workaround this case?

Thx

Aug 12 '23 08:08 leiwen83

Hi @leiwen83, Were you able to use the fast tokenizer in LightLLM? If yes, would you like to open a pull request ? 😄

Aug 13 '23 03:08 llehtahw

You can save fast tokenizer in advane, refer to fix slow tokenizer

Then auto load fast tokenizer if you want. I have modified in my code

Aug 16 '23 02:08 super-buster

lightllm lightllm copied to clipboard

Auto convert without tokenizer.json to prevent performance downgrade?

lightllm
lightllm copied to clipboard