Soham Kar comments

Results 8 comments of


                                            Soham Kar

Comparison against TF-IDF Vectorizer (from scratch)

Is it okay if I work on this issue during Hacktoberfest?

Comparison against TF-IDF Vectorizer (from scratch)

I have a few questions regarding the implementation: - Will there be a corpus of documents provided against which to run the TF-IDF algorithm? (Something like `get_keywords(document, documents, stopwords)`?) -...

Comparison against TF-IDF Vectorizer (from scratch)

tf-idf is a corpus based algorithm whereas RAKE can work on single documents. Like, if I have a collection of documents, I can figure out the tf-idf scores for words...

Comparison against TF-IDF Vectorizer (from scratch)

Oh, okay. So do I change the function signature to be something like `def keywords(docs, selected_doc, stopwords)`?

Comparison against TF-IDF Vectorizer (from scratch)

Alright, thank you so much!

Comparison against TF-IDF Vectorizer (from scratch)

The code for keyword extraction using TF-IDF goes into `tfidf_vectorizer/extract_keywords_tfidf_scratch.py`. Where do the tests of RAKE vs TF-IDF go? I noticed there's a `tests` folder, does it go there?

Comparison against TF-IDF Vectorizer (from scratch)

So far, I've implemented tf-idf and written a __main.py__ file for testing its usage. I've also created a file in `tests` which runs both RAKE and tf-idf against a document...

Comparison against TF-IDF Vectorizer (from scratch)

@BALaka-18 I've sent a PR, could you please take a look?