Roy Guo
Roy Guo
Hi, I'm trying to use `Jieba.Cut(text, result)` here, but the result shows that, it counts `offset`s by bytes, not unicode characters. My text content have Chinese and English characters mixed,...
No need to allocate a new ArrayList here, reduce text extraction time from 16 seconds to 14 seconds on a 4.2M pdf.
A quick fix for the stats log (this bug rarely happen) Signed-off-by: Kuankuan Guo
Current we don't have enough UT to cover all functions, I think we should start working on better UT pipeline
``` typedef struct raxNode { uint32_t iskey:1; /* Does this node contain a key? */ uint32_t isnull:1; /* Associated value is NULL (don't store it). */ uint32_t iscompr:1; /* Node...
`m_keys.remove(iter->second);`