text-embeddings-inference icon indicating copy to clipboard operation
text-embeddings-inference copied to clipboard

Upgrade tokenizers to 0.19.1 to deal with breaking change in tokenizers

Open scriptator opened this issue 9 months ago • 0 comments

This is necessary in order to load models whose tokenizers have been created by a version after the breaking change https://github.com/huggingface/tokenizers/pull/1476 (i.e. >= v0.19.0)

Fixes #265

Before submitting

  • [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • [ ] Did you read the contributor guideline, Pull Request section?
  • [x] Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case. --> #265
  • [ ] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
  • [ ] Did you write any new necessary tests? --> not necessary

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

scriptator avatar May 15 '24 13:05 scriptator