yake icon indicating copy to clipboard operation
yake copied to clipboard

Single-document unsupervised keyword extraction

Results 35 yake issues
Sort by recently updated
recently updated
newest added

The range of valid values for the score, what it means, and how it is computed should be stated explicitly in the README.

I was looking if I can port Yake to iOS/MAC. I am novice in python and data science 😿. I am assuming that the main logic written pke->Yake.py. but then...

Uses the same order of keyword & score regardless of value provided for `dedupLim`

I've noticed the following behavior of the `.extract_keywords` function: When using a deduplication threshold (`dedupLim`) lower than 1, the response tuples are of the form `(word, score)`. e.g.: ```python ('non-profit',...

Hi Can you please explain how what are the [leve|jaro|seqm] and how do they work ? Is there some documentation link I could use to know how to choose among...

I see that we cannot change the deduplication parameters when using the API: https://github.com/LIAAD/yake/blob/master/docker/Dockerfiles/yake-server/yake-rest-api.py#L117 So it just uses the defaults: https://github.com/LIAAD/yake/blob/d2fc406c52d08843c2cb511d31d859980a7291e4/yake/yake.py#L14 Could we be able to provide those when invoking...

Estou usando o projeto mas sinto falta de um arquivo que junte todas as stopwords em um pois existe artigos que mistura idiomas.

After reading the paper, I thought that numbers would be discarded when extracting keywords, however when running the keyword extractor on the 20newsgroup dataset, specifically the document 20news_home/20news-bydate-train/misc.forsale/75935.txt The following...

## Summary I recommend changing the README to either: 1. Suggest `pip install yake` instead of `pip install git+https://github.com/LIAAD/yake` 2. Explain that pip installing the git repo is preferred to...

Hi, I found out that when using YAKE for long documents, it can be advantageous to truncate them in advance. We have a test set of theses and dissertations (766...