CorpusLoaders.jl icon indicating copy to clipboard operation
CorpusLoaders.jl copied to clipboard

Adding gmb dataset

Open tejasvaidhyadev opened this issue 4 years ago • 3 comments

Adding GMB Dataset. The dataset an extract from GMB corpus which is tagged, annotated and built specifically to train the classifier to predict named entities such as name, location, etc.

tejasvaidhyadev avatar Mar 07 '20 23:03 tejasvaidhyadev

Thankyou I will implement suggested changes(including Docs and tests ) soon

tejasvaidhyadev avatar Mar 12 '20 19:03 tejasvaidhyadev

Hi @oxinabox added some testsets by taking examples from other datasets.I don't know much about tests and i am still learning. let me know what else tests can be added.

tejasvaidhyadev avatar Mar 13 '20 22:03 tejasvaidhyadev

Hi @oxinabox For now I added only POS tagged of GMB As my project only need POS tags and i will also implement NER tags soon Thanks

tejasvaidhyadev avatar Mar 14 '20 15:03 tejasvaidhyadev