AdvancedLiterateMachinery icon indicating copy to clipboard operation
AdvancedLiterateMachinery copied to clipboard

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Results 68 AdvancedLiterateMachinery issues
Sort by recently updated
recently updated
newest added

Have not changed anything and uses Synthtext for training, loss still stays around 3.0 after 2hours training?

![issue](https://github.com/AlibabaResearch/AdvancedLiterateMachinery/assets/68524490/5800625d-bebc-4286-a00a-01663b95799d) if inputs_embeds is None: inputs_embeds = self.word_embeddings(input_ids) token_type_embeddings = self.token_type_embeddings(token_type_ids) embeddings = inputs_embeds + token_type_embeddings if self.position_embedding_type == "absolute": position_embeddings = self.position_embeddings(position_ids) embeddings += position_embeddings if "line_bbox" in kwargs:...

I train the model without pretrained weights, the final results are as below [according to the paper, ser f1 should be 83.39%, re f1 should be 74.91%]: ![image](https://github.com/AlibabaResearch/AdvancedLiterateMachinery/assets/31800397/707e57cd-045c-43be-ba38-0486a6d96a8f) However, when...

I'm reading the paper. And for our project, the pretraining part is more helpful so I wonder if you guys also shared these code somewhere? Also I have a question...

作者,您好!VLPT-STD是个很棒的工作,但是目前发布的只有预训练的代码,能否提供测试模型的代码和交叉注意力可视化的代码?

The data on which we want to finetune Geolayout LM model doesnt have key-value pairs. For ex Name,Address,Dates etc Is it possible to ignore entity linking part and just train...

![issue](https://github.com/AlibabaResearch/AdvancedLiterateMachinery/assets/68524490/6a129631-331a-4885-8fc2-4a62ff8ae063) Thank you very much for your help before, I've been working on your model recently, but I'm confused about the calculation of H1, H2, and H3 in the graph,...

Thank you for your excellent work and releasing the source code. LISTER is very inspiring for solving the corner cases of STR on long text. Could you give more details...