AdvancedLiterateMachinery
AdvancedLiterateMachinery copied to clipboard
processing of the pre-training dataset IIT CDIP 1.0
Can you please provide the code used to process the pre-training dataset IIT CDIP 1.0? I am now trying to do retraining weights for use with a new encoder. Any help from the developers would be greatly appreciated.
for geolayoutlm