pytextrank Ignore tokens and enrich the lemma graph

Ignore tokens and enrich the lemma graph

Open Albertobegue opened this issue 5 years ago • 1 comments

trafficstars

Hi everyone!

It is mentioned in the project's description that enriching the lemma graph would improve TextRank's performance. I saw that showing examples of this was in the todo list of the project but I was wondering if it worked by simply adding entities to the doc before summarising? Or is it more complicated? I am particularly interested in adding hyponymy.

And what about ignoring tokens? Some tokens are ignored depending on their POS tag in your implementation. Is it possible to ignore tokens specific to our application by tagging them? With what?

Thanks in advance for your answers!!

And thank you for this project, it is great!

May 12 '20 11:05 Albertobegue

Thank you @Albertobegue we're working toward these capabilities with the kglab integration.

Feb 15 '21 17:02 ceteri

pytextrank pytextrank copied to clipboard

Ignore tokens and enrich the lemma graph

pytextrank
pytextrank copied to clipboard