llama3
llama3 copied to clipboard
Can I use the transformers.AutoTokenizer to load the tokenizer?
I know the tokenizer.py in this Repo use TikTokenizer, can I use transformers.AutoTokenizer to load the tokenizer so that I dont need to amend my code class? And if i not use tokenizer.py, ChatFormat can not be used too.
I mean transformers.PretrainedTokenizer class
same question
I find the solution, you should use model files on huggingface. There is a tokenizer.json file can be loaded directly.
Yes, you can use AutoTokenizer.from_pretrained('meta-llama/Meta-Llama-3-8B-Instruct)