orkenstein
orkenstein
The [documentation](https://musicbrainz.org/doc/MusicBrainz_Database/Download) states that: >Alternatively, if you are not interested in having a local MusicBrainz website and web service, you can use [mbdata](https://github.com/lalinsky/mbdata) that includes replication without the rest of...
When running on **macOs** the command should be changed from: ```yml "sudo apt install tesseract-ocr -y" ``` to: ```yml "brew install tesseract" ``` Maybe there's a way to detect the...
I'm trying to use `Tokenizer` for ModernBERT like this: ```python model_name = "answerdotai/ModernBERT-large" tokenizer=AutoTokenizer.from_pretrained(model_name) print(tokeniser.model_max_length) # 1000000000000000019884624838656 ``` Which gives `model_max_length == int(1e30)` Shouldn't the `tokenizer` contain a correct max...