Aflah
Aflah
@mattdangerw while we figure this layer out I think I could work on a different EDA operation which does not require WordNet so that we can get an API design...
@mattdangerw Yup agreed a good synonym dataset would alleviate all these language specific issues. I'll continue sharing anything useful that I find here
@chenmoneygithub Yup that's exactly what I'm proposing. So basically in my experience with annotating span based data the annotators aren't expected to tag a sentence as BIO directly rather they...
In hindsight it can be something else and not necessarily a layer like maybe part of a set of helper functions which we might offer and develop over time however...
@chenmoneygithub One use case for BIO tags is in Named Entity Recognition Models. So the use case is basically a sequence-to-sequence learning setting where each token is tagged according to...
@chenmoneygithub Oh that's nice! So basically I don't think I can speak in general however in my experience with working with annotators and annotating data myself the sentence and spans...
As a side note with the advent of better annotating tools such as [Prodigy](https://prodi.gy/) this is often handled by the tool itself but i still think there would be labs...
@chenmoneygithub You're Welcome! Sure I'll share a colab for the same
@mattdangerw I think we might need that but I'm not totally sure as a lot of this (I feel) is subjective to how the data is being formatted, I've also...
@chenmoneygithub So the API would take in the context, answer and a splitting scheme? and the task would be to find a the matching subarray of the split answer tokens...