neuralqa

neuralqa copied to clipboard

Reame
Issues

Revise Scoring and Answer Span Selection Method

Open victordibia opened this issue 5 years ago • 0 comments

[ ] Currently score is a sum of the start and end token probabilities. This might not be optimal.
[ ] Current there is no text token preprocessing (e.g. strip spaces, remove \n etc) which might introduce unexpected behaviour.
[ ] Currently, answer span is selected as highest proabability (start/end). There is opportunity to do better.

Resources

https://github.com/huggingface/transformers/blob/master/src/transformers/pipelines.py#L1395

Jul 26 '20 15:07 victordibia