unilm
unilm copied to clipboard
About mean and variance
You guys do a fascinating job in the document analysis field.
I have a question about the preprocess of the document image in Dit. Why would you use the mean and variance from imagenet datasets rather than the one you calculated? In my opinion, you have tons of thousands of document images that are probably larger than the imageNet.