pyliterature
pyliterature copied to clipboard
A Pythonic wrapper for the scientific journal, including Nature, Science, ScienceDirect, Wiley, ACS publication, RSC publication and so on.
pyliterature
A Python web wrapper and text mining package for the scientific journal, including Nature, Science, ScienceDirect, Wiley, ACS publication, RSC publication and so on.
Author
- Xing Wang [email protected]
Dependencies
- Python3
- selenium
- beautifulsoup4
- nltk
Examples
from pyliterature import Pyliterature
url = 'http://www.nature.com/nature/journal/v541/n7635/full/nature20782.html'
keyword = 'DFT'
liter = Pyliterature(url, keyword)
liter.parser()
for keysent in liter.keysents:
print(keysent)
print('\n')
If you want to add features/improvement or report issues, feel free to send a pull request!
TODO
- read bib list
- determine sentence from which article