Aflah comments

Results 125 comments of


                                            Aflah

Synonym Replacement Layer - Data Augmentation

@mattdangerw while we figure this layer out I think I could work on a different EDA operation which does not require WordNet so that we can get an API design...

Synonym Replacement Layer - Data Augmentation

@mattdangerw Yup agreed a good synonym dataset would alleviate all these language specific issues. I'll continue sharing anything useful that I find here

BIO/IOB Tagging Text and Vice-a-Versa

@chenmoneygithub Yup that's exactly what I'm proposing. So basically in my experience with annotating span based data the annotators aren't expected to tag a sentence as BIO directly rather they...

BIO/IOB Tagging Text and Vice-a-Versa

In hindsight it can be something else and not necessarily a layer like maybe part of a set of helper functions which we might offer and develop over time however...

BIO/IOB Tagging Text and Vice-a-Versa

@chenmoneygithub One use case for BIO tags is in Named Entity Recognition Models. So the use case is basically a sequence-to-sequence learning setting where each token is tagged according to...

BIO/IOB Tagging Text and Vice-a-Versa

@chenmoneygithub Oh that's nice! So basically I don't think I can speak in general however in my experience with working with annotators and annotating data myself the sentence and spans...

BIO/IOB Tagging Text and Vice-a-Versa

As a side note with the advent of better annotating tools such as [Prodigy](https://prodi.gy/) this is often handled by the tool itself but i still think there would be labs...

BIO/IOB Tagging Text and Vice-a-Versa

@chenmoneygithub You're Welcome! Sure I'll share a colab for the same

BIO/IOB Tagging Text and Vice-a-Versa

@mattdangerw I think we might need that but I'm not totally sure as a lot of this (I feel) is subjective to how the data is being formatted, I've also...

`char_to_token` in `keras_nlp.tokenizers.Tokenizer`

@chenmoneygithub So the API would take in the context, answer and a splitting scheme? and the task would be to find a the matching subarray of the split answer tokens...