docformer icon indicating copy to clipboard operation
docformer copied to clipboard

Pre-trained models

Open caop-kie opened this issue 2 years ago • 5 comments

Thanks for the great work! Do you have any plan to release the pre-trained model of docformer?

caop-kie avatar Oct 07 '22 15:10 caop-kie

Hi @AYSP, thanks for your appreciation. We have the scripts ready as for now, to pre-train DocFormer, but not sure if it would produce the extact same results as that of paper, since the author basically didn't describe the exact collection of data they used for pre-training (although it was RVL-CDIP), and beside that, there is resource constraint with us, so that also makes it a bit difficult to pre-train.

Regards, Akarsh

uakarsh avatar Oct 08 '22 04:10 uakarsh

@uakarsh Can you release the existing pre-training code. Even thought it doesn't produce good results it would be good as an starting point.

jmandivarapu1 avatar Nov 11 '22 03:11 jmandivarapu1

Hi @jmandivarapu1,

Although I didn't write the entire code, but I did write till the part where the pytorch dataset object could be made and one iteration/batch's forward and backward pass could be done

Here is the code https://github.com/shabie/docformer/blob/master/examples/DocFormer_for_MLM.ipynb

Hope it helps.

uakarsh avatar Nov 11 '22 04:11 uakarsh

I would be working from my side for MLM (although there are 3 pre-training task) and would update shortly.

Thanks,

uakarsh avatar Nov 11 '22 06:11 uakarsh

Hi @jmandivarapu1 @AYSP can you guys again try the fine-tuning using the pre-trained weights (I have attached them in the readme)

uakarsh avatar Feb 13 '23 07:02 uakarsh