handwritten-tf-1.0 icon indicating copy to clipboard operation
handwritten-tf-1.0 copied to clipboard

Vocabulary Contents

Open selcouthlyBlue opened this issue 6 years ago • 1 comments

What does vocabulary.txt contain? Do they contain encoded words? Do they contain encoded characters? How was that generated?

selcouthlyBlue avatar May 02 '18 04:05 selcouthlyBlue

vocabulary.txt contains the words in the dataset in order to use them for a language model(n-grams for example). the file is created in the utils.py. but if you have enough data the neural network also can learn the language models end-to-end

johnsmithm avatar May 05 '18 10:05 johnsmithm