kse icon indicating copy to clipboard operation
kse copied to clipboard

Broken string parsing in Byte consumers

Open Ichoran opened this issue 7 years ago • 0 comments

As reported by Andriy Plokhotnyuk, both the ByteBuffer and the ByteArrayInputStream (i.e. RecyclingParser) versions fail on the following input:

""""倒排索引(英语:Inverted index),也常被称为反向索引、置入档案或反向档案,是一种索引方法,被用来存储在全文搜索下某个单词在一个文档或者一组文档中的存储位置的映射。它是文档检索系统中最常用的数据结构。"""".getBytes

JsonRecyclingParser has the worse error in that it fails to even parse the string; JsonByteBufferParser parses successfully but encodes all the content as \uffXX where XX is the raw UTF-8 byte value.

Ichoran avatar Dec 13 '17 15:12 Ichoran