bazaar
bazaar copied to clipboard
Bazaar/Parser doesn't correctly escape some characters in TSV
trafficstars
For example, carriage returns should also be escaped properly but not, hence causing troubles like HazyResearch/deepdive#523.
I think this part of the code needs more careful work to conform to Postgres' TSV format or some other stricter standard: https://github.com/HazyResearch/bazaar/blob/c09dce20f16a90c359f804f9e83d6107547d442c/parser/src/main/scala/com/clearcut/nlp/DocumentParser.scala#L98