wolfshow comments

Results 40 comments of


wolfshow

Model saving failed after the first epoch

Thanks @jsenellart ! Which sampling method do you recommend to use?

[markuplm] Unable to use with Huggingface

@NielsRogge Any updates for adding MarkupLM to Transformers?

Finetuning description for DocVQA using LayoutLMv3

@allanj You may refer to this https://github.com/anisha2102/docvqa for data pre-processing.

How to annotate the own receipt images for layoutLM

@SandyRSK Basically, you may need pre-process the SROIE dataset into token-level and fed the data into LayoutLM.

How to use layoutlmv3 in industry environment?

@matthew-wei Exporting LayoutLMv3 models into ONNX is not difficult because LayoutLMv3 only used standard operators in Transformers.

How to use layoutlmv3 in industry environment?

You may find information at https://github.com/microsoft/unilm/tree/master/layoutlmv3

Is layloutlmv3 support for SER and RE work for xfund dataset?

@githublsk, LayoutLMv3 supports SER and RE tasks, but it is an English model while XFUND is used to evaluate the LayoutXLM model.

Is layloutlmv3 support for SER and RE work for xfund dataset?

LayoutLMv3 is pre-trained with English documents only. For Chinese tasks, it is better to use LayoutXLM.

TrOCR - Segmentation Issue

@sameearif88 TrOCR is trained for text recognition only. For TrOCR, the input should be token-level or line-level, so you need to use text detection tools for line segmentation. We have...

how to train TrOCR for a new Language

@StephennFernandes Basically, you need to prepare the training data for Kannada. If you have any documents written in Kannada, you may use that. Otherwise, you can generate the training data...