donut
donut copied to clipboard
Trying to run DOCVQA dataset
I was trying DOCVQA dataset which is presented in the original repository. I added gt_parses in the train_v1.0.json in the given format. First I got error from pyarrow, which I solved. Now, I am getting this error.
raise DatasetGenerationError("An error occurred while generating the dataset") from e datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset
Please help in releasing the docvqa dataset format, specially the metadata.jsonl