character-bert-pretraining
character-bert-pretraining copied to clipboard
Code for pre-training CharacterBERT models (as well as BERT models).
Hi I am wondering how can I fine-tune this model on my dataset?
The training script hangs at [this](https://github.com/helboukkouri/character-bert-pretraining/blob/97160755b685f5970c96c1d6cb1a1c471fef08e9/pretrain_model.py#L420) line. And does nothing after that. I have looked up this suggestion and tried to set word size, Address and Port before that but...
Hi do you know how I can download wikipedia data for all languages?
The download links are no longer available. Changing the paths to "https://dumps.wikimedia.org/enwiki/latest/" may prevent this issue from happening again, but this might bring up an inconsistency in reproducing the results.