profanity-check Added train_model.py and made the necessary modifications

Added train_model.py and made the necessary modifications

Open menkotoglou opened this issue 4 years ago • 4 comments

Reverse engineered training script
Updated versions of scikit-learn and included it in setup.py and requirements.txt
Retrained model with training data and updated version as in (2)
Dropped support for Python3.5 (see Travis configuration), because it was no longer compatible with current scikit-learn version
Fixed test as one test case is now below threshold

We tested these changes on a private dataset with the following results:

Before:

Predicted Actual | Not Profane(0) | Profane(1) Not Profane(0) | 703 | 14 Profane(1) | 93 | 39

Accuracy Score: 87.4%

After:

Predicted Actual | Not Profane(0) | Profane(1) Not Profane(0) | 697 | 20 Profane(1) | 87 | 45

Accuracy Score: 87.4%

Used in production as committed here: https://github.com/dimitrismistriotis/profanity-check

Jun 30 '20 19:06 menkotoglou

@koti How do I use this build?

Sep 01 '20 06:09 ieshaan12

@koti How do I use this build?

By referencing the other repository. For pip + "requirements.txt", use the following instead of "profanity-check":

-e git+https://github.com/dimitrismistriotis/profanity-check.git#egg=profanity-check

Also check this issue here if @vzhou842 accepts it, you can bring back profanity-check.

Sep 01 '20 07:09 dimitrismistriotis

@dimitrismistriotis Thanks! I was wondering if we could implement a function which censors content like the profanity package?

Sep 01 '20 11:09 ieshaan12

@dimitrismistriotis Thanks! I was wondering if we could implement a function which censors content like the profanity package?

Censor is a very broad concept, also didn't get the "like the profanity package" part: profanity detects

Sep 15 '20 10:09 dimitrismistriotis