Akarsh
Akarsh
@shabie Thanks for the tag. @athewsey, as far as the weights are concerned, I have tried implementing their MLM task (described in the repo), as well as Image Reconstruction Part...
As an update, the authors would be sharing the Textract OCR for the RVL CDIP Dataset, and as soon as they release it, we would try to achieve the benchmark...
Have completed the scripts for pre-training on MLM, and using DocFormer for Document Image Classification. Check it out here [DocFormer Examples with PyTorch-Lightening](https://github.com/uakarsh/docformer/tree/master/examples/docformer_pl)
Hi @WaterKnight1998 we have been able to train the model, you can find it [here](https://github.com/shabie/docformer/tree/master/examples/docformer_pl). The list of things done till now are: - [x] Pre-training script for DocFormer on...
Sure, with some modifications to the script of Document Image Classification and pre-processing, we would definitely be able to use it for token classification
Hi @Irlirion, are you still working on this PR? I would be happy to help you add a graph regression task in Flash.
Sure, I can convert it into a script For your questions: 1. The reason for using batch size 1 is, that I was using Amazon Sagemaker Studio (and, taking a...
Would surely do it today. Got a bit engaged in my own work and about using PyTorch Lightning with Speech Recognition.
Update: Have updated the script (along with instructions for downloading the dataset) and removed the notebook.
Sure, I have added the entry for the updation in [CHANGELOG.md](https://github.com/Lightning-AI/lightning-flash/blob/master/CHANGELOG.md) The change is [here](https://github.com/Lightning-AI/lightning-flash/compare/master...uakarsh:lightning-flash:patch-1) Is there any need of creating a PR for the same updation, i.e in CHANGELOG.md...