Balaka Biswas comments

Results 38 comments of


                                            Balaka Biswas

Enable GitHub Actions for code coverage.

@jawsvk make sure your PR comes in on 1st October, because Code Coverage is vital before the code based PRs start pouring in.

Comparison against TF-IDF Vectorizer (from scratch)

> Is it okay if I work on this issue during Hacktoberfest? Yes sure @2bit-hack . Just make sure the PR is made after Oct 1. Assigning it to you.

Comparison against TF-IDF Vectorizer (from scratch)

> I have a few questions regarding the implementation: > > * Will there be a corpus of documents provided against which to run the TF-IDF algorithm? > (Something like...

Comparison against TF-IDF Vectorizer (from scratch)

> tf-idf is a corpus based algorithm whereas RAKE can work on single documents. Like, if I have a collection of documents, I can figure out the tf-idf scores for...

Comparison against TF-IDF Vectorizer (from scratch)

Oh yes..thanks for pointing that out. Yes you change the function to include the corpus. Make sure you include the ```max_num``` parameter.

Comparison against TF-IDF Vectorizer (from scratch)

> The code for keyword extraction using TF-IDF goes into `tfidf_vectorizer/extract_keywords_tfidf_scratch.py`. Where do the tests of RAKE vs TF-IDF go? I noticed there's a `tests` folder, does it go there?...

Comparison against TF-IDF Vectorizer (from scratch)

@2bit-hack Basically you have to toil a bit. What they have done in the paper is that, they have marked out keywords manually. Then they ran the algorithms, and checked...

UI design

@johnnthough the black one's better. Also, resolve the branch conflicts.