cjj

Results 5 issues of cjj

如何对待切分可能有歧义的拼音? xian = 先/西安, linan = 李楠/临安 是否有可能把所有切分都列出来?

如平翘舌音,前后鼻音等在严格匹配的会被认为是错误,但是有没有办法吧相近的拼音也给召回,但是匹配的分数可以给一个惩罚(比如*0.3之类的)

我使用首字母搜索的时候发现翘舌音(z/c/s+h)会在一起导致搜索异常。 比如库中有“中华人民共和国”: curl -XGET 'localhost:9200/news/_search' -d '{"query":{"match_phrase":{"name":"zhonghua"}}}' curl -XGET 'localhost:9200/news/_search' -d '{"query":{"match_phrase":{"name":"rm"}}}' 均能正确搜索结果,但是 curl -XGET 'localhost:9200/news/_search' -d '{"query":{"match_phrase":{"name":"zh"}}}' curl -XGET 'localhost:9200/news/_search' -d '{"query":{"match_phrase":{"name":"zhrm"}}}' 却不行,应该是因为z+h认为是一个字导致无法识别 我的setting和mapping分别是 setting: `"index" : { "analysis"...

Does Texar support multi-gpus?

enhancement
help wanted
question

Error Log message":"UnpicklingError: invalid load key, 'x'.\n At:\n /usr/local/python-3.7/lib/python3.7/site-packages/whoosh/filedb/structfile.py(245): read_pickle /usr/local/python-3.7/lib/python3.7/site-packages/whoosh/codec/whoosh3.py(941): _goto /usr/local/python-3.7/lib/python3.7/site-packages/whoosh/codec/whoosh3.py(961): _next_block /usr/local/python-3.7/lib/python3.7/site-packages/whoosh/codec/whoosh3.py(1009): next /usr/local/python-3.7/lib/python3.7/site-packages/whoosh/matching/mcore.py(215): all_ids When I use whoosh in a concurrent scenario with multiple processes, the...