conlleval
conlleval copied to clipboard
why the result is different between conlleval.py and connlleval_perl.py?
my output file is like:
a B-LOC B-LOC
b I-LOC E-LOC
c E-LOC S-LOC
and the result is different, for conlleval.py
processed 3 tokens with 1 phrases; found: 2 phrases; correct: 0.
accuracy: 33.33%; (non-O)
accuracy: 33.33%; precision: 0.00%; recall: 0.00%; FB1: 0.00
LOC: precision: 0.00%; recall: 0.00%; FB1: 0.00 2
for connlleval_perl.py
processed 3 tokens with 1 phrases; found: 1 phrases; correct: 1.
accuracy: 33.33%; precision: 100.00%; recall: 100.00%; FB1: 100.00
LOC: precision: 100.00%; recall: 100.00%; FB1: 100.00 1
which one is reliable?
As far as I can see conlleval.py
is the correct one here. Did you manage to find out what is causing the difference?
I guess I did not implement the evaluation metrics consistently. I will look into this later. Sorry for any inconvenience.
请问这个issue的问题解决了吗?