lismin

Results 3 issues of lismin

Thank you for your work. Can you disclose how the training data was obtained, whether it was synthesized or scraped?

https://github.com/microsoft/ArxivFormula

dataset

This is the official way to add: ``` # processing_florence2.py tokens_to_add = {'additional_special_tokens': \ tokenizer.additional_special_tokens + \ ['', '', '', ''] + \ [f'' for x in range(1000)] + \...