soft404
soft404 copied to clipboard
Building a classifier from scratch
The current training dataset is too big to put in a repo or host on s3 indefinitely. It was created with a crawler that is in the repo, but still it would be nice to have some way to re-train the classifier from scratch. See discussion in #3