indonlu
indonlu copied to clipboard
Benchmark table is not consistent (am I wrong?)
I read the paper and compare it to this website : https://www.indobenchmark.com/leaderboard.html . It seems that the sequence labelling benchmark is not the same. I also tried my own fine-tuning, and the result is closer to the one on the paper rather than that in the website. is there any explanation regarding this problem?
Thank you for creating the issue. The numbers on the website is outdated. You can refer to the paper for the more updated results.
It seems the result on the web is better than that on the paper. If the one on the paper is the latest, how come the performance is decreased?