yake
yake copied to clipboard
Single-document unsupervised keyword extraction
The range of valid values for the score, what it means, and how it is computed should be stated explicitly in the README.
I was looking if I can port Yake to iOS/MAC. I am novice in python and data science 😿. I am assuming that the main logic written pke->Yake.py. but then...
Uses the same order of keyword & score regardless of value provided for `dedupLim`
I've noticed the following behavior of the `.extract_keywords` function: When using a deduplication threshold (`dedupLim`) lower than 1, the response tuples are of the form `(word, score)`. e.g.: ```python ('non-profit',...
Hi Can you please explain how what are the [leve|jaro|seqm] and how do they work ? Is there some documentation link I could use to know how to choose among...
I see that we cannot change the deduplication parameters when using the API: https://github.com/LIAAD/yake/blob/master/docker/Dockerfiles/yake-server/yake-rest-api.py#L117 So it just uses the defaults: https://github.com/LIAAD/yake/blob/d2fc406c52d08843c2cb511d31d859980a7291e4/yake/yake.py#L14 Could we be able to provide those when invoking...
Estou usando o projeto mas sinto falta de um arquivo que junte todas as stopwords em um pois existe artigos que mistura idiomas.
After reading the paper, I thought that numbers would be discarded when extracting keywords, however when running the keyword extractor on the 20newsgroup dataset, specifically the document 20news_home/20news-bydate-train/misc.forsale/75935.txt The following...
## Summary I recommend changing the README to either: 1. Suggest `pip install yake` instead of `pip install git+https://github.com/LIAAD/yake` 2. Explain that pip installing the git repo is preferred to...
Hi, I found out that when using YAKE for long documents, it can be advantageous to truncate them in advance. We have a test set of theses and dissertations (766...