Soham Kar
Soham Kar
Is it okay if I work on this issue during Hacktoberfest?
I have a few questions regarding the implementation: - Will there be a corpus of documents provided against which to run the TF-IDF algorithm? (Something like `get_keywords(document, documents, stopwords)`?) -...
tf-idf is a corpus based algorithm whereas RAKE can work on single documents. Like, if I have a collection of documents, I can figure out the tf-idf scores for words...
Oh, okay. So do I change the function signature to be something like `def keywords(docs, selected_doc, stopwords)`?
Alright, thank you so much!
The code for keyword extraction using TF-IDF goes into `tfidf_vectorizer/extract_keywords_tfidf_scratch.py`. Where do the tests of RAKE vs TF-IDF go? I noticed there's a `tests` folder, does it go there?
So far, I've implemented tf-idf and written a __main.py__ file for testing its usage. I've also created a file in `tests` which runs both RAKE and tf-idf against a document...
@BALaka-18 I've sent a PR, could you please take a look?