KeyphraseVectorizers icon indicating copy to clipboard operation
KeyphraseVectorizers copied to clipboard

Cannot use this for japanese text

Open MARUD84 opened this issue 2 years ago • 0 comments

Dear Tim Schopf, Thank you for your effort in creating this library. It has been really useful for extracting key phrases for English, German and Chinese text, but i am having trouble applying it to Japanese text. As suggested i have changed the spacy_pipeline to ja_core_news_sm and have a list of Japanese stop words, but i am getting the following error: ValueError: Empty keyphrases. Perhaps the documents do not contain keyphrases that match the 'pos_pattern' parameter or only contain stop words. Do you happen to know why this might be?

MARUD84 avatar Nov 17 '22 07:11 MARUD84