PyCharlockHolmes
PyCharlockHolmes copied to clipboard
Character encoding detecting library for Python using ICU and libmagic.
Charlock Holmes
Character encoding detecting library for Python using ICU and libmagic. Inspired by Charlock Holmes
Dependency
- icu
- file(libmagic)
Gentoo
emerge -av dev-libs/icu
emerge -av sys-apps/file
Ubuntu
apt-get install libicu-dev
apt-get install libmagic-dev
Brew
brew install icu4c
brew install libmagic
export ICUI18N="/usr/local/Cellar/icu4c/xx" # Replace "xx" as the version of your icu
export MAGIC="/usr/local/Cellar/libmagic/xx" # Replace "xx" as the version of your libmagic
Install
python setup build
python setup install
Usage
from charlockholmes import detect
file = open('test.txt')
content = file.read()
print detect(content)
CHANGELOG
- 0.0.3
- Add support for python 3.X
License
Modified BSD License
