polymath
polymath copied to clipboard
Figure out text cleaning
While importing text, text can be gnarly. We need a consistent way to clean it.
There are two places where that happens currently:
- in
chunker.py: https://github.com/dglazkov/polymath/blob/main/convert/chunker.py#L43 - In
main.py: https://github.com/dglazkov/polymath/blob/main/convert/main.py#L58
Let's figure out a single way.