bazaar icon indicating copy to clipboard operation
bazaar copied to clipboard

Bazaar/Parser doesn't correctly escape some characters in TSV

Open netj opened this issue 9 years ago • 2 comments
trafficstars

For example, carriage returns should also be escaped properly but not, hence causing troubles like HazyResearch/deepdive#523.

I think this part of the code needs more careful work to conform to Postgres' TSV format or some other stricter standard: https://github.com/HazyResearch/bazaar/blob/c09dce20f16a90c359f804f9e83d6107547d442c/parser/src/main/scala/com/clearcut/nlp/DocumentParser.scala#L98

netj avatar Mar 16 '16 03:03 netj