Swathikiran Sudhakaran
Swathikiran Sudhakaran
I was using the Unimodal 224. However, from the paper, the performance of the various models vary only between [+2, -2] at the maximum. Anyway, I will try the other...
Hi, I tried the other two variants (512 and dual) as well. These models also did not result in any significant improvement. So far the best score obtained on DocVQA...
Could you please provide the following details? 1. Which model is used for preprocessing the data (generating memmaps)? Is it the t5-large provided by due-benchmark or the UDOP pretrained model?...
1. I used the T5-Large provided by due-benchmark for preprocessing the data. 2. The recommended transformers version 4.30.0 was giving `loss does not have a grad function` error. So I...