Brandon Smock comments

Results 33 comments of


                                            Brandon Smock

Does process_fintabnet.py not output labels for fine-tuning table detection?

We probably should have included extra code to create table detection data, but I believe this was left out because we only do table structure recognition in our latest paper....

How to only compute standard object detection metrics for structure model?

Hi, The code doesn't currently support this because we didn't design TATR to be used without words. But it makes sense to still want to evaluate the model performance. We...

issue with package_area = package_rect.get_area() when adding --words_dir arg

Hi, Since Rect is in the form (x0, y0, x1, y1), in your case we would have y0 > y1, which means technically the box has negative area. Is this...

How to load fine-tuned model into Hugging-Face Table-Transformer

If you're training on the original PubTables-1M and FinTabNet.c (FinTabNet.a6) together then one reason you may see lower numbers during evaluation is we changed how we evaluate on PubTables-1M in...

Postprocessing col/row refinement before alignment!

Hi, thanks for bringing this up, it's a great point. If I understand you correctly, this situation can happen when two predicted columns are non-overlapping but stacked vertically, one above...

Question regarding postprocessing

Great observation, I see the discrepancy you are referring to. The key question is: what do we do when "no-object" is the most probable class prediction? Do we suppress/filter out...

ValueError: y1 must be greater than or equal to y0

Hi, Is this with one of the pre-trained models and one of the released datasets? If so can you share which model and which input image? If not, is there...

How to handle the nesting of tables in PDF?

Hi, I think with anything in document information extraction, the first question I would ask is: what would you like the final result to be? That question comes before deciding...

Inference individual Image for table detection

In the current version of the code, you can find the function that takes the model output and processes it into a table representation here: https://github.com/microsoft/table-transformer/blob/3e1dd0c3cad7956c790765b491ec86817e94ce43/src/grits.py#L727

Inference individual Image for table detection

> How can I get the page_tokens if I will use a new Image input Right now the code is written to be used with the PubTables-1M dataset or any...