TextRank4ZH issues

在我的项目中引用了您的工作

这是我的repo中引用了您的项目的部分：https://github.com/PolarisRisingWar/text_summarization_chinese/tree/master/models/textrank 我的项目是希望集合一些经典文本摘要模型在中文文本数据上的解决方案，所以引用了您的工作。如果有引用有误或侵犯版权的情况请指出。

PolarisRisingWar

Fix un-closed file when opening stop_words_file.

Warning detail: ResourceWarning: unclosed file for word in codecs.open(self.stop_words_file, 'r', 'utf-8', 'ignore'): What I have done: Simply called close to stop_words_file.

RUI-LONG

提高词的权重

文章里有几个词不经常出现，但只要出现一次往往就是关键词，请问有没有办法提高这些词的权重？

suckjiba

請問是否能套用 word2Vec?

個人理解，目前的作法為 BoW 還是我只要把 `get_similarity` 改寫即可?

frankShih

是否能夠帶入 supervised 資訊，進而影響 ranking?

您好我所處理的文章，通常在正式的內容中還會夾雜一些與文章主題本身相關性較小的"廢文" 主要是用來吸引讀者，確保他們能夠看到最後然而，這樣的文章直接套您的工具的話，會導致一些無關緊要的字句排到很前面因此想請教一下，是否能狗透過一些手段，提供 model 一點 guide 達到類似於 semi-supervised 的效果?

frankShih

AttributeError

2

我用的networkx-3.1 如果出现错误：AttributeError: module 'networkx' has no attribute 'from_numpy_matrix'，可以按如下方法解决问题：将utils中的nx_graph = nx.from_numpy_matrix(graph)，改为nx_graph = nx.from_numpy_array(graph)

ZaiYu411

**错误如下:** AttributeError: module 'networkx' has no attribute 'from_numpy_matrix' ![image](https://github.com/letiantian/TextRank4ZH/assets/99784648/6a6eba64-6d21-4070-ac1d-fbec6afa4c86) **高版本python环境难以降级networkx，所以使用nx.from_numpy_array 代替 from_numpy_matrix** 修复后效果: ![image](https://github.com/letiantian/TextRank4ZH/assets/99784648/6f4b715d-e12c-4586-92ea-93dbf2817a5e)

Gracdjd

TextRank4ZH
TextRank4ZH copied to clipboard

Metadata

在我的项目中引用了您的工作

Fix un-closed file when opening stop_words_file.

如何训练

textrank4zh每次抽取的关键短语不一样

提高词的权重

jieba分词

請問是否能套用 word2Vec?

是否能夠帶入 supervised 資訊，進而影響 ranking?

AttributeError

高版本的python环境无法运行此代码

← Metadata

Owner

Metadata

TextRank4ZH TextRank4ZH copied to clipboard

Metadata

← Metadata

Owner

Metadata

TextRank4ZH
TextRank4ZH copied to clipboard