Soham Kar

Results 8 comments of Soham Kar

Is it okay if I work on this issue during Hacktoberfest?

I have a few questions regarding the implementation: - Will there be a corpus of documents provided against which to run the TF-IDF algorithm? (Something like `get_keywords(document, documents, stopwords)`?) -...

tf-idf is a corpus based algorithm whereas RAKE can work on single documents. Like, if I have a collection of documents, I can figure out the tf-idf scores for words...

Oh, okay. So do I change the function signature to be something like `def keywords(docs, selected_doc, stopwords)`?

The code for keyword extraction using TF-IDF goes into `tfidf_vectorizer/extract_keywords_tfidf_scratch.py`. Where do the tests of RAKE vs TF-IDF go? I noticed there's a `tests` folder, does it go there?

So far, I've implemented tf-idf and written a __main.py__ file for testing its usage. I've also created a file in `tests` which runs both RAKE and tf-idf against a document...

@BALaka-18 I've sent a PR, could you please take a look?