DocBank icon indicating copy to clipboard operation
DocBank copied to clipboard

DocBank: A Benchmark Dataset for Document Layout Analysis

Results 26 DocBank issues
Sort by recently updated
recently updated
newest added

Hi!I am using huggingface to the pretrained_weights of layoutlm_large_500k_epoch_1.But huggingface shows me the errors as below: Traceback (most recent call last): File "D:\SoulCode\PaddleDetection\DocBank\DocBank_infer.py", line 8, in model = LayoutLMForTokenClassification.from_pretrained("D:\Download\layoutlm_large_500k_epoch_1") File...

This lead to a very confused result during training and it's difficult to locate this hidden problem...Need modify the categories order to keep them same. Though this is a small...

PDF process script needs to have requirements needed for it to run as a separate file. In pdfplumber v.0.5.24 reference to Container.figures has been removed and script is not working...

I have been working on DocBank_samples since a month now. Today I downloaded the main dataset from onedrive and I could not see any pdf files! I wanted to request...

Hi, Do you have a plan for releasing faster rcnn weights and code?

Hello, I am trying to use the X101 arch from the Model ZOO as a backbone for one of my experiments with the DOCBANK dataset. I am using the COCO...

Which classes corresponds to which ID in the pretrained network (e.g. author = ID 8)? I tried the three different data subsets, but to me it looked like none of...

pdf_process.py报错, traceback (most recent call last): | 0/42 [00:00

Hi: Thank you for your datasets. Direct download in browser is unstable. Could you offer a method by which we can download the data from the command line, e.g. wget?...

Hi, @liminghao1630 @ranpox @doc-analysis could you please product the direct downloadable link in the onedrive please, so we could download it on the server?