CLIP
CLIP copied to clipboard
Question about simple_tokenizer.
Hey, I have noticed that the code for tokenization/simple_tokenizer.py) is very similiar to Gpt-2 encoding except that the vocabulary contains tokens ending with "". What is the meaning of ""?
Would be grateful for any help