Akarsh

Results 2 repositories owned by Akarsh

docformer

22
Stars
3
Forks
Watchers

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

latr

51
Stars
7
Forks
Watchers

Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)