exjieba icon indicating copy to clipboard operation
exjieba copied to clipboard

Elixir verison of Jieba, a Chinese word segmentation module.

** Exjieba [[https://travis-ci.org/falood/exjieba/][file:https://api.travis-ci.org/falood/exjieba.png?branch=master]] Elixir verison of Jieba Base on [[https://github.com/aszxqw/libcppjieba/][libcppjieba]].

** Usage #+BEGIN_SRC elixir Erlang/OTP 22 [erts-10.4.4] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [hipe]

rm -rf priv/_segment. git submodule update --init mkdir -p priv &&
cc -g -fPIC -O3 -I/home/falood/.asdf/installs/erlang/22.0.7/erts-10.4.4/include -Ipriv/libcppjieba/include -shared -lstdc++ -DLOGGER_LEVEL=LL_ERROR src/mp_segment.cpp -o priv/mp_segment.so 2>&1 >/dev/null mkdir -p priv &&
cc -g -fPIC -O3 -I/home/falood/.asdf/installs/erlang/22.0.7/erts-10.4.4/include -Ipriv/libcppjieba/include -shared -lstdc++ -DLOGGER_LEVEL=LL_ERROR src/hmm_segment.cpp -o priv/hmm_segment.so 2>&1 >/dev/null mkdir -p priv &&
cc -g -fPIC -O3 -I/home/falood/.asdf/installs/erlang/22.0.7/erts-10.4.4/include -Ipriv/libcppjieba/include -shared -lstdc++ -DLOGGER_LEVEL=LL_ERROR src/mix_segment.cpp -o priv/mix_segment.so 2>&1 >/dev/null mkdir -p priv &&
cc -g -fPIC -O3 -I/home/falood/.asdf/installs/erlang/22.0.7/erts-10.4.4/include -Ipriv/libcppjieba/include -shared -lstdc++ -DLOGGER_LEVEL=LL_ERROR src/query_segment.cpp -o priv/query_segment.so 2>&1 >/dev/null Interactive Elixir (1.9.1) - press Ctrl+C to exit (type h() ENTER for help) iex(1)> ExJieba.MixSegment.cut "工信处女干事每月经过下属科室都要亲口交代24口交换机等技术性器件的安装工作" ["工信处", "女干事", "每月", "经过", "下属", "科室", "都", "要", "亲口", "交代", "24", "口", "交换机", "等", "技术性", "器件", "的", "安装", "工作"] #+END_SRC

** TODO_LIST

  • [ ] custom dict
  • [X] MPSegment
  • [X] MixSegment
  • [X] FullSegment
  • [X] QuerySegment

** THANKS

  • [[https://github.com/aszxqw/libcppjieba/][libcppjieb]] by aszxqw
  • [[https://github.com/fxsjy/jieba][jieba]] by SunJunyi