CascadeTabNet icon indicating copy to clipboard operation
CascadeTabNet copied to clipboard

Testing with my scanned documents.

Open zokai opened this issue 5 years ago • 8 comments

I would like to try CascadeTabNet with my own images. I used main.py and set the followings path:

image_path xmlPath config_fname

However I don't know what to set for these variables: checkpoint_path = "path to checkpoint directory" epoch = 'epoch_file.name'

Does your github repository come with a pre-trained model based on your dataset? or I have to train your model locally?

Thanks

zokai avatar Sep 30 '20 15:09 zokai

Hi Zokai,

Would suggest point checkpoint to "cascade_mask_rcnn_hrnetv2p_w32_20e.py" and epoch to "epoch_36.pth"

cabudies avatar Oct 07 '20 09:10 cabudies

I agree with cabudies answer and would also recommend you to go through ~[Model Zoo of CascadeTabNet] (https://github.com/DevashishPrasad/CascadeTabNet#6-model-zoo). Model zoo includes different checkpoint files along with dataset on which they were trained.

epoch_36.pth

This model was trained on a highly specific dataset. I would recommend to start with a model trained on more general dataset such as

epoch_24.pth

and try different models whichever suits your need.

Kk-ships avatar Oct 08 '20 02:10 Kk-ships

Thank you.

On Wed, Oct 7, 2020 at 10:54 PM Kaustubh Shirpurkar < [email protected]> wrote:

I agree with cabudies answer and would also recommend you to go through ~[Model Zoo of CascadeTabNet] ( https://github.com/DevashishPrasad/CascadeTabNet#6-model-zoo). Model zoo includes different checkpoint files along with dataset on which they were trained.

epoch_36.pth

This model was trained on a highly specific dataset. I would recommend to start with a model trained on more general dataset such as

epoch_24.pth

and try different models whichever suits your need.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/DevashishPrasad/CascadeTabNet/issues/90#issuecomment-705299139, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACE6S2ZL4TLUHLFRDUGCQQTSJUSWXANCNFSM4R7GTG5Q .

zokai avatar Oct 08 '20 03:10 zokai

Hi, did you succeed in detecting your table structure with the given checkpoint ?

VincentJousse avatar Jan 14 '21 10:01 VincentJousse

Yes I was able to get table structure out using same checkpoint.

Kk-ships avatar Jan 14 '21 12:01 Kk-ships

Was it a border or a borderless table ? Did you use Colab ? Did you tweak anything to get good results ?

VincentJousse avatar Jan 14 '21 16:01 VincentJousse

Could you share your python code ?

VincentJousse avatar Jan 14 '21 16:01 VincentJousse

I used colab version. Here is the link for colab notebook. I think you need to make some changes while using images. Code is pretty self explanatory. https://colab.research.google.com/drive/16GzDZqfWCf3Kt6_EOk7FKZ7sHTYii-_w?usp=sharing

Kk-ships avatar Jan 18 '21 07:01 Kk-ships