indonlu icon indicating copy to clipboard operation
indonlu copied to clipboard

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

Results 9 indonlu issues
Sort by recently updated
recently updated
newest added

hi, I wanna ask about the training dairy/behaviour. do you have notes on it? or at least the number of the final proplexity for each of the bert types. It...

Fixes # Update LICENSE file ## Proposed Changes - Add copyright for all collaborators

Hi, thank you IndoNLU team for making this indobert model. I'm currently working on thesis with this IndoBERT for BertForMultiLabelClassification Task. I have successfully run the "finetune_casa.ipynb" provided in the...

## Expected Behavior The length of tokenizer vocab size and the BERT's word embedding layer dimension should be the same ## Actual Behavior The length of tokenizer vocab size and...

Refactor: - `run_single_task.sh` - `run_all_tasks.sh` - `run_non_pretrained_no_special_token.sh` by: - Create `scripts/config/model/train.yaml` to easily define the model used - Make `scripts/reproducer.py` and `Makefile` for easy ops *Note: only tested this by...

documentation

I read the paper and compare it to this website : https://www.indobenchmark.com/leaderboard.html . It seems that the sequence labelling benchmark is not the same. I also tried my own fine-tuning,...

I am trying to upload a submission through the Codalab competition portal. However, everytime i tried to submit a new submission, it said "Submission upload has been disabled. See the...

Hi @gentaiscool, thank you very much for the works! I am trying to evaluate a model for Indonesian NER, and would want to evaluate on a manually annotated dataset. I...

So, where to find a relevant information?