entity-fishing
entity-fishing copied to clipboard
Support acronyms
Detect acronyms introduced (explicitly or not) in a document , and maintain them as possible mention in the current document.
Example: frequent for name of species (C. Lupus, C. n. gregoryi), Cigarette smoke (CS)-induced
Done but it needs further testing.
Some tests for recognizing acronyms were done for the text and also Pdf disambiguation process.
- For the text disambiguation process with the text of PubMed_2, the result shows the recognition of all the explicitly acronym
Cigarette smoke (CS)
despite of a problem that the Type of the acronym can be different.

Another issue will be opened related to this problem.
- For the Pdf disambiguation service, this issue also deals with other issues #69 about generating layout tokens for acronyms.
This issue is closed and another issue #74 is opened.
I reopen it because there are still quite a lot of cases not supported (mixture of upper/lower cases in aconym, special sombols like -) and issue #74 is not specific to acronyms.