course
course copied to clipboard
The Hugging Face course on Transformers
when i studied the "6. The tokenizers liabrary -- Unigram tokenization" , i couldn't understand the following why the P("pu") = 5/210, shouldn't it be the 17 / 210, because...
Hi,thanks for your excellent course and translation.Recently,I found one (maybe) mistranslation during the learning process. In [2.mdx of chapter 3 in zh-CN](https://github.com/huggingface/course/blob/main/chapters/zh-CN/chapter3/2.mdx), I found that the word 'datasets' was misspelled...
docs(zh-cn): Reviewed 36_slice-and-dice-a-dataset-🔪.srt #390
fix typo.
The variable `tokenized_dataset` isn't initialized in the code. Leading to exception when the cell is reached. On the other hand `tokenized_datasets` var does exist. It was probably intended to be...
Second portion of the chapter, which focuses on `accelerate` library, uses `tokenized_dataset` variable. However this variable doesn't exist, leading to error if person following the guide attempts to execute the...
make Easy to understand
This PR adds a helpful tip for users to ensure proper setup before using the MRPC dataset. The tip reminds users to check if the `datasets` package is installed by...
https://huggingface.co/learn/nlp-course/chapter6/7?fw=pt says: > In this (very) particular case, we had two equivalent tokenizations of all the words: as we saw earlier, for example, "pug" could be tokenized ["p", "ug"] with...