ChemDataExtractor
ChemDataExtractor copied to clipboard
Automatically extract chemical information from scientific documents
ChemDataExtractor
.. image:: http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square :target: https://pypi.python.org/pypi/ChemDataExtractor
.. image:: http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square :target: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE
.. image:: http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square :target: https://travis-ci.org/mcs07/ChemDataExtractor
ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.
Features
- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies
Installation
To install ChemDataExtractor, simply run::
pip install chemdataextractor
Or if you are an Anaconda user, run::
conda install -c chemdataextractor chemdataextractor
Alternatively, try one of the other installation options
_.
Documentation
Full documentation is available at http://chemdataextractor.org/docs
License
ChemDataExtractor is licensed under the MIT license
_, a permissive, business-friendly license for open source
software.
.. _installation options
: http://chemdataextractor.org/docs/install
.. _MIT license
: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE