Tookit-Sihui
Tookit-Sihui copied to clipboard
Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(func_recursive),中文数字转阿...
Tookit-Sihui
tookit_sihui(代码主体,未完待续...)
- ml_common
- BM25
- TF-IDF
- Trie-Tree
- func_recursive
- chinese_and_number
- task
- calculate_sihui
run(运行)
- 1. 进入tookit_sihui/ml_common/tf_idf/目录,
python tf_idf_freq.py
项目说明
- ml_common
- BM25(似乎有点问题)
- TF-IDF(tf-idf, 可设置-保存tf和idf的文件)
- Trie-Tree(前缀树,可实现人名-影视名等实体快速搜索)
- func_recursive(递归,规则遍历生成句子)
- chinese_and_number(中文汉字转阿拉伯数字,或者是阿拉伯数字转汉语数字,支持小数)
- task
- calculate_sihui(思慧计算器,AI智能文本计算器,支持从文本到计算结果的混合运算,还有指数运算,对数运算,阶乘等)
感谢|参考
- 时间提取项目zhanzecheng/Time_NLP: https://github.com/zhanzecheng/Time_NLP
- 中文阿拉伯数字转化项目tyong920/a2c: https://github.com/tyong920/a2c