Akarsh
Results
2
repositories owned by
Akarsh
docformer
22
Stars
3
Forks
Watchers
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
latr
51
Stars
7
Forks
Watchers
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)