instructor-embedding
instructor-embedding copied to clipboard
Redundant code & Tokenize issue
- Seems like there's a redundant code here.
- Tokenize method not working well for creating correct instruction_mask column
For the second problem, it appeared when I was using the 2.7.0 version of the sentence_transformers.
If I'm using the 3.0.0 version of the sentence_transformers, then the local file issue still exists even if I change my code according to the #115 or #113
After you pip install the normal instructor package, try replacing the instructor.py file in the site-packages folder with the one in my fork here...let me know if this fixes the issue:
https://github.com/BBC-Esq/instructor-embedding