i-Code icon indicating copy to clipboard operation
i-Code copied to clipboard

Document-parsing example

Open WaterKnight1998 opened this issue 1 year ago • 1 comments

Good morning,

First off all, thank you very much for open sourcing this model.

I have been looking at this model as an alternative to Donut for Document Parsing, I think we will get better performance as OCR data is included.

However, after checking your repository I just saw scripts for document classification and understanding. An example for Document Parsing or token classification will be helpfull. For document parsing I mean an example similar to this one for CORD dataset.

Thanks in advance! @zinengtang @ziyi-yang

Best regards

WaterKnight1998 avatar Mar 11 '23 13:03 WaterKnight1998

Thanks again for your interest in our work. Here's an example of how UDOP works on an actual document: https://github.com/microsoft/i-Code/blob/main/i-Code-Doc/example_io.ipynb

Thanks.

ziyi-yang avatar Mar 27 '23 17:03 ziyi-yang

Hi @ziyi-yang , thank you for your amazing works. I follow the notebook with task_prefix = 'document classification', it outputs the final result as expected form However, when I change the task_prefix = 'layout analysis', it still outputs form So, Is the model only trained for one downstream task document classification ? (More information, I use the model udop-unimodel-large-224 downloaded from https://huggingface.co/ZinengTang/Udop)

thinh-huynh-re avatar Jun 07 '23 10:06 thinh-huynh-re

I have a question same as @thinh-huynh-re. Could someone help us with that? Also, how to generate 00070353.json in the examples folder of i-Code-Doc?

glahoti6 avatar Sep 12 '23 22:09 glahoti6

Hi folks, we made progress regarding this, it now works!

See https://discuss.huggingface.co/t/using-udop-for-layout-analysis/76871.

NielsRogge avatar Mar 12 '24 13:03 NielsRogge