KeyBERT
KeyBERT copied to clipboard
Minimal keyword extraction with BERT
Refer: [YAKE: Usage (Command line)](https://github.com/LIAAD/yake?tab=readme-ov-file#usage-command-line)
When consider using [Maximal Marginal Relevance (MMR)](https://github.com/MaartenGr/KeyBERT?tab=readme-ov-file#24-maximal-marginal-relevance) to diversify the results it turns out the keyword or keyphrases order are based on the cosine similarity scores instead of the MMR....
This is also in preparation of replacing flake8 and black with ruff (https://github.com/MaartenGr/KeyBERT/issues/232). There are a few issues with the code base regarding its own pre-commit config. `black` and the...
Hi, again thank you really for your great tool. how can i limit the number of max tokens ?
Would you like to change `black` and `flake8` to ruff-format and ruff-lint in your pre-commit-config.yaml? See https://github.com/MaartenGr/BERTopic/pull/2033
Just my reciprocal contribution to a great library. I appreciate you creating this...here's my contribution: https://github.com/BBC-Esq/KeyBERT_GUI
Is it possible to load local embeddings model instead of downloading from Huggingface?
I have been playing with KeyBERT and KeyLLM for a while now. And here is something I would like to achieve. If have a text "CO2 emissions are high these...
I'm using jieba for tokenization for my Chinese documents, as suggested here in the issues and in the documentation. It also says in the documentation that if I use a...
I use spacy's transformer model for other purposes (such as NER), so re-using the same model made sense. Looks like Spacy made some tweaks to their syntax which are breaking...