Geewook Kim
Geewook Kim
@DYF-AI Yes, during the fine-tuning phase, the vision encoder is trained as well.
Thank you all (@arifoyong , @starchaser49 , @dyf102 , @sxn2012 , @chopin1998 , @miomiora ) for bringing this issue to our attention, and a special thanks to @ken6078 for the...
Hi @mustaszewski (cc @NielsRogge ), Apologies for the delayed response. Let me address your question. In general, for documents, a raster scanning order is typically used. However, during the training...
Hello, and thank you for your interest in our research. The synthdog tool we've released is designed to mimic document images with simple text lines rendered in a simple rule-based...