unilm
unilm copied to clipboard
Model I am using LayoutLMv3 on Article Regions dataset
Describe Model I am using LayoutLmv3: I downloaded base model, and use the config file based on Publaynet. I training the model on Article Regions dataset,a little object detection dataset based on PubMed Central,but the result is much lower than faster rcnn(bbox map 65 vs 72). So it's there any sensitive parameters to change for small dataset? Or something important about finetuning on other dataset?
@Howardqlz you may start with DiT models for object detection which is easier for training/testing.