indonlu
indonlu copied to clipboard
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
hi, I wanna ask about the training dairy/behaviour. do you have notes on it? or at least the number of the final proplexity for each of the bert types. It...
Fixes # Update LICENSE file ## Proposed Changes - Add copyright for all collaborators
Hi, thank you IndoNLU team for making this indobert model. I'm currently working on thesis with this IndoBERT for BertForMultiLabelClassification Task. I have successfully run the "finetune_casa.ipynb" provided in the...
## Expected Behavior The length of tokenizer vocab size and the BERT's word embedding layer dimension should be the same ## Actual Behavior The length of tokenizer vocab size and...
Refactor: - `run_single_task.sh` - `run_all_tasks.sh` - `run_non_pretrained_no_special_token.sh` by: - Create `scripts/config/model/train.yaml` to easily define the model used - Make `scripts/reproducer.py` and `Makefile` for easy ops *Note: only tested this by...
I read the paper and compare it to this website : https://www.indobenchmark.com/leaderboard.html . It seems that the sequence labelling benchmark is not the same. I also tried my own fine-tuning,...
I am trying to upload a submission through the Codalab competition portal. However, everytime i tried to submit a new submission, it said "Submission upload has been disabled. See the...
Hi @gentaiscool, thank you very much for the works! I am trying to evaluate a model for Indonesian NER, and would want to evaluate on a manually annotated dataset. I...
So, where to find a relevant information?