DocRED_Bert

A bert baseline for DocRED (https://github.com/thunlp/DocRED)

Use the random undersampling to balance the sample scale of positive and negative (no relation) samples.
Only one relation will be contained in one sample.

Include rel2id.json

Weights in https://github.com/google-research/bert, model and convert in https://github.com/huggingface/pytorch-transformers,

Please use 'convert_feature'.

Function 'convert_feature_multioutput' is used to build datasets as the offical baseline, but it seems that this strategy works badly in bert.

All: Precision:46.336, Recall:78.334, F1-score:58.228

Ignore: Precision:42.544, Recall:76.520, F1-score:54.684

All: Precision:56.772, Recall:70.718, F1-score:62.981

Ignore: Precision:52.836, Recall:68.814, F1-score:59.776

DocRED_Bert
DocRED_Bert copied to clipboard