DeepPavlov Finding Cuss Words with NER

What problem are we trying to solve?:

Training the existing Bert Model to identify cuss words and in future even deal with sarcasm

How can we solve it?:

By using a semantics of words or using google word2vec and creating vector for identifying and training the model.

Are there other issues that block this solution?:

Easy to identify cuss words in english, but the same word might mean different thing in other languages. Need to train a model which incorporates this.

Mar 11 '21 13:03 potato-patata

Hey @potato-patata , would like to work on this issue. Can we do this translation using the googletrans library and then applying the semantics of words for training the model?

Mar 18 '21 21:03 rashmiprabhat567

Hey @rashmiprabhat567 we can definitely use it, but we need to find a way to incorporate with dp embeddings. So it is better if we first look into dp embeddings and then proceed.

Mar 18 '21 21:03 potato-patata

Hey @rashmiprabhat567 we can definitely use it, but we need to find a way to incorporate with dp embeddings. So it is better if we first look into dp embeddings and then proceed.

got it @potato-patata . I'm looking over to solve this issue. will let you know if I can make any progress

Mar 19 '21 04:03 rashmiprabhat567

hey @potato-patata ! Is this still open to work on?

Apr 07 '21 18:04 rushilpatra

Hi, this issue is not for gsoc task. This is just an enhancement suggestion.

Apr 08 '21 01:04 potato-patata

Moreover, I am not the mentor so I apologise if I might have mislead you 😄

Apr 08 '21 01:04 potato-patata

Hi, @potato-patata! Sorry for the late response. We already have a model that allows to determine toxicity in texts, there is also a pull request where a new emotion classifier has been added.

Jul 07 '23 12:07 Kolpnick

DeepPavlov DeepPavlov copied to clipboard

Finding Cuss Words with NER

DeepPavlov
DeepPavlov copied to clipboard