user1018
user1018
Thanks for your detailed explanation, and the training GPUs of DocScanner is NVIDIA RTX 2080 Ti GPUs and NVIDIA GTX 1080 Ti GPU, which one is used in DocTr?
When writing the training code,I have some confusion. 1)Before training the GeoTr module, the background needs to be removed. Is it handled by the pre-trained model of the Segmentation module?...
Is there any reference code? And what does GT masks represent in the doc3d dataset? 
@fh2019ustc I've written the training code, but the model does not converge. I'vd send the code to your email([email protected]), could you look at the code?Thanks very much.