unilm icon indicating copy to clipboard operation
unilm copied to clipboard

Can LayoutLM be used for language generation ?

Open pzdkn opened this issue 2 years ago • 2 comments

I am using LayoutLM2 and LayoutLM3 for Key-Information Extraction. Since the output annotations are normalized, it's difficult to get token-level annotations.

I thought about rephrasing such tasks as a language generation problem instead, similar to Marksend et al, Doc2Dict: Information Extraction as Text Generation. However, is LayoutLM even capable/good at language generation ?

pzdkn avatar Jun 20 '22 13:06 pzdkn

@pzdkn LayoutLM can be used as a general-purpose encoder for downstream tasks. You may need to design the decoder for generation or copy operations for language generation tasks.

wolfshow avatar Jun 22 '22 03:06 wolfshow

@pzdkn any update ?

CheungZeeCn avatar Nov 07 '22 14:11 CheungZeeCn