lismin issues

Repositories
Issues
Comments

Results 3 issues of


                                            lismin

Training data

Thank you for your work. Can you disclose how the training data was obtained, whether it was synthesized or scraped?

latex detection dataset

https://github.com/microsoft/ArxivFormula

dataset

Florence-2: How to add custom tokens during fine-tuning training?

This is the official way to add: ``` # processing_florence2.py tokens_to_add = {'additional_special_tokens': \ tokenizer.additional_special_tokens + \ ['', '', '', ''] + \ [f'' for x in range(1000)] + \...