handwritten-tf-1.0
handwritten-tf-1.0 copied to clipboard
Vocabulary Contents
What does vocabulary.txt
contain? Do they contain encoded words? Do they contain encoded characters? How was that generated?
vocabulary.txt contains the words in the dataset in order to use them for a language model(n-grams for example). the file is created in the utils.py. but if you have enough data the neural network also can learn the language models end-to-end