stylometry
stylometry copied to clipboard
Added a little bit of language options (just French) and matrix output
Hey man! Thanks for the library. I forked it because I wanted two things extra that your code did not had:
- Handle languages from the constructor and be able to have a list of words (like the English ones) in French. I thought that it would be interesting to have stopwords as these words with the translation of the default words (and, but, if, that, etc) in English. This of course strays away from the paper you based the library on.
- I wanted access to the feature matrix created. Maybe this was implemented but I could not find it. I find it handy that you have your own sklearn methods but maybe people want just to get access to the feature data matrix.
- I also put the English words in a list and I changed some things related to that...
Cheers! Pavel