Brandon Smock

Results 33 comments of Brandon Smock

I believe the issue in this example is the padding that the original pre-trained model expects around the table. If you're running the inference script from the command line, try...

Hi, The model weights and current optimizer state is saved after every epoch. If epoch N completes and then training is interrupted after that, you can resume training at epoch...

Hi, sometimes the padding around the table can affect the pre-trained model we released. But in that case only the edge rows and edge columns are usually affected. That's fixed...

Hi, That code was written with PyMuPDF in mind, which is a Python library that can be used to extract words from digital-born PDF documents. Every word extracted by PyMuPDF...

Hi, Are you using the model trained only on PubTables-1M? I can see why that model would be confused: it hasn't seen very many tables (if any) where a dollar...

My first thought would be: is it possible for you to create a simple post-processing rule to filter these unwanted tables out when they occur? If you want the model...

There is no difference between the two currently. The Hugging Face model was converted directly from the one in this repo. (This could change in the future, though).

Yes, this is possible when training DETR, but we did not write the data loader to handle empty training samples. I believe it would be just a few lines of...

First of all, congrats on integrating OCR with the model code. This looks very well done and we hope it inspires others to do the same! As far as your...

This should be possible with probably just a few changes to the code depending on how closely what you're trying to do matches what we did. You said you have...