NLP-KG-Projects
NLP-KG-Projects copied to clipboard
自己之前收集的自然语言处理和知识图谱相关的一些开源项目
trafficstars
NLP-KG-Projects
自己之前收集的自然语言处理和知识图谱相关的一些开源项目(star怪),有的内容详尽,有的内容稀少,都是以todolist的形式记录的。
其中各深度学习项目的实现框架基本都是Pytorch。
例如,
自然语言处理任务实现:
文本摘要
文本分类
- [ ] 649453932/Chinese-Text-Classification-Pytorch: 中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
- [ ] Tencent/NeuralNLP-NeuralClassifier: An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
- [ ] songyingxin/TextClassification-Pytorch: Pytorch + NLP, 一份友好的项目实践仓库
- [ ] prakashpandey9/Text-Classification-Pytorch: Text classification using deep learning models in Pytorch
- [ ] EdGENetworks/attention-networks-for-classification: Hierarchical Attention Networks for Document Classification in PyTorch
- [ ] Shawn1993/cnn-text-classification-pytorch: CNNs for Sentence Classification in PyTorch
- [ ] ZephyrChenzf/text-classification-pytorch: 使用pytorch和京东某商品评价数据集,采用不同模型对文本进行分类
- [ ] wabyking/TextClassificationBenchmark: A Benchmark of Text Classification in PyTorch
- [ ] bigboNed3/chinese_text_cnn: TextCNN Pytorch实现 中文文本分类 情感分析
- [ ] Cheneng/TextCNN: Convolutional Neural Networks for Sentence Classification in PyTorch
- [ ] real-brilliant/bert_chinese_pytorch: bert for chinese text classification
- [ ] brightmart/text_classification: all kinds of text classification models and more with deep learning
- [ ] gaussic/text-classification-cnn-rnn: CNN-RNN中文文本分类,基于TensorFlow
中文分词
- [ ] Moonshile/ChineseWordSegmentation: Chinese word segmentation algorithm without corpus(无需语料库的中文分词)
- [ ] fxsjy/jieba: 结巴中文分词
- [ ] Chinese Word Segmentation by Iterated Dilated Convolutions
- [ ] ZephyrChenzf/participle: 三个分词器,一个使用bilstm+viterbi,一个使用n-gram,一个使用cnn+bilstm+crf
关键词提取
- [ ] ArtistScript/FastTextRank: 中文文本摘要/关键词提取
- [ ] aneesha/RAKE: A python implementation of the Rapid Automatic Keyword Extraction
主题模型
聊天机器人
- [ ] zhaoyingjun/chatbot: 一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本和tf2.0版本。
- [ ] ywk991112/pytorch-chatbot: Pytorch seq2seq chatbot
- [ ] jinfagang/pytorch_chatbot: A Marvelous ChatBot implement using PyTorch.
句法分析
- [ ] nikitakit/self-attentive-parser: High-accuracy NLP parser with models for 11 languages.
- [ ] baoy-nlp/FAParser: A Fast(er) and Accurate Syntactic Parsing by Exacter Searching.
依存分析
- [ ] Samurais/text-dependency-parser: 自然语言处理之依存关系分析
- [ ] akashrajkn/dependency-parser: Neural graph-based dependency parser
- [ ] daandouwe/biaffine-dependency-parser: A graph based dependency parser in PyTorch.
- [ ] EelcovdW/Biaffine-Parser: Biaffine Dependency Parser, implemented in PyTorch.
- [ ] zysite/biaffine-parser: An implementation of "Deep Biaffine Attention for Neural Dependency Parsing"
- [ ] elikip/bist-parser: Graph-based and Transition-based dependency parsers based on BiLSTMs
- [ ] wddabc/bist-parser: Graph-based and Transition-based dependency parsers based on BiLSTMs
语义相似度
- [ ] liuhuanyong/SiameseSentenceSimilarity: SiameseSentenceSimilarity,个人实现的基于Siamese bilstm模型的相似句子判定模型,提供训练数据集和测试数据集.
- [ ] buxcon/siamese_lstm: A PyTorch implementation for 'Siamese Recurrent Architectures for Learning Sentence Similarity'.
- [ ] RandolphVI/Text-Pairs-Relation-Classification: About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.
- [ ] demelin/Sentence-similarity-classifier-for-pyTorch: Re-implementation of Mueller's et al., "Siamese Recurrent Architectures for Learning Sentence Similarity." (AAAI, 2016)
- [ ] adambielski/siamese-triplet: Siamese and triplet networks with online pair/triplet mining in PyTorch
- [ ] yanqiangmiffy/sentence-similarity: 问题句子相似度计算,即给定客服里用户描述的两句话,用算法来判断是否表示了相同的语义。
- [ ] tlatkowski/multihead-siamese-nets: Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
序列标注
- [ ] LiyuanLucasLiu/LM-LSTM-CRF: Empower Sequence Labeling with Task-Aware Language Model
- [ ] sgrvinod/a-PyTorch-Tutorial-to-Sequence-Labeling: Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
新词发现
语义角色标注
- [ ] weiyuhan/chinese_srl_birnn: Chinese Semantic Role Labeling with Bidirectional Recurrent Neural Networks
- [ ] Nrgeup/chinese_semantic_role_labeling: 基于 Bi-LSTM 和 CRF 的中文语义角色标注
- [ ] luheng/deep_srl: Code and pre-trained model for: Deep Semantic Role Labeling: What Works and What's Next
- [ ] phueb/DLSRL: Deep Bidirection LSTM for Semantic Role Labeling
- [ ] sanjaymeena/semantic_role_labeling_deep_learning: SRL deep learning model is based on DB-LSTM which is described in this paper : 《End-to-end learning of semantic role labeling using recurrent neural networks》
- [ ] XMUNLP/Tagger: Deep Semantic Role Labeling with Self-Attention
词性标注
- [ ] Joyce94/Joint-model-for-Word-Segmentation-and-POS: Implementation for paper:"A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging".
- [ ] bamtercelboo/pytorch_Joint-Word-Segmentation-and-POS-Tagging: Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging
词向量
- [ ] weberrr/pytorch_word2vec: pytorch word2vec Four implementations : skip gram / CBOW on hierarchical softmax / negative sampling
- [ ] ksang/word2vec: A word2vec CBOW and Skip-gram implementation in PyTorch
- [ ] fanglanting/skip-gram-pytorch: A complete pytorch implementation of skip-gram
- [ ] jeffchy/pytorch-word-embedding: Pytorch Collection and Implementation of Various Word Embedding
- [ ] liuhuanyong/ChineseEmbedding: Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
- [ ] stanfordnlp/GloVe: GloVe model for distributed word representation
句子向量
文本生成
- [ ] ClownW/Char-RNN-Pytorch: Char-RNN building by Pytorch to generating Chinese text
- [ ] stardut/Text-Generate-RNN: 中国古诗生成(文本生成)
- [ ] wandouduoduo/SunRnn: 一个基于最新版本TensorFlow的Char RNN实现。可以实现生成英文、写诗、歌词、小说、生成代码、生成日文等功能。
- [ ] halxp1/lstm-text-generation: 文本生成(Word2Vec + RNN/LSTM)
- [ ] Kevinwenya/GenerateText: 文本生成--从原理到实现
- [ ] yxtay/char-rnn-text-generation: Character Embeddings Recurrent Neural Network Text Generation Models
- [ ] hzy46/Char-RNN-TensorFlow: Multi-language Char RNN for TensorFlow >= 1.2.
机器翻译
知识图谱任务实现
表示学习
- [ ] Lapis-Hong/TransE-Knowledge-Graph-Embedding: TensorFlow implementation of TransE and its extended models for Knowledge Representation Learning
- [ ] ZichaoHuang/TransE: A TensorFlow implementation of TransE model
- [ ] thunlp/OpenKE: An Open-Source Package for Knowledge Embedding (KE)
- [ ] ShulinCao/OpenKE-PyTorch
- [ ] wencolani/TransE: An implementation of TransE with tensorflow
命名实体识别
- [ ] Chinese Named Entity Recognition by IDCNN/biLSTM with CRF layer
- [ ] liuwei1206/CCW-NER: The source code of Chinese Character-Word NER.
- [ ] buppt/ChineseNER: 中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF
- [ ] LiyuanLucasLiu/LightNER: Inference with state-of-the-art models (pre-trained by LD-Net / AutoNER / VanillaNER / ...)
- [ ] glample/tagger: Named Entity Recognition Tool
- [ ] AdolHong/sequence_tagging: Named Entity Recognition (biLSTM + CRF) - PyTorch
- [ ] ZhixiuYe/NER-pytorch: LSTM+CRF NER
- [ ] synalp/NER
- [ ] ZephyrChenzf/NER-Sequence-labeling--Textcnn-bilstm-crf-pytorch: pytorch用Textcnn-bilstm-crf模型实现命名实体识别
- [ ] sberbank-ai/ner-bert: BERT-NER (nert-bert) with google bert https://github.com/google-research.
- [ ] macanv/BERT-BiLSTM-CRF-NER: Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
- [ ] zjy-ucas/ChineseNER: A neural network model for Chinese named entity recognition
- [ ] guillaumegenthial/sequence_tagging: Named Entity Recognition (LSTM + CRF) - Tensorflow
- [ ] Determined22/zh-NER-TF: A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)
关系抽取
- [ ] thunlp/OpenNRE: An Open-Source Package for Neural Relation Extraction (NRE)
- [ ] zjunlp/deepke: 基于深度学习的开源中文关系抽取框架
- [ ] HazyResearch/deepdive: DeepDive
- [ ] xiaolalala/Distant-Supervised-Chinese-Relation-Extraction: 基于远监督的中文关系抽取
- [ ] Chinese Relation Extraction by biGRU with Character and Sentence Attentions
- [ ] buppt/ChineseNRE: 中文实体关系抽取,pytorch,bilstm+attention
- [ ] ShomyLiu/pytorch-pcnn: supervised relation extraction for PCNN (Zeng 2014) in pytorch 关系抽取
- [ ] qipeng/gcn-over-pruned-trees: Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)
- [ ] ZhixiuYe/Intra-Bag-and-Inter-Bag-Attentions: Code for NAACL 2019 paper: Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions
- [ ] ShulinCao/OpenNRE-PyTorch: Neural Relation Extraction implemented in PyTorch
- [ ] ShomyLiu/pytorch-relation-extraction: distant supervised relation extraction models: PCNN MIL (Zeng 2015), PCNN+ATT(Lin 2016). 关系抽取
- [ ] ybch14/RelationExtraction-NIS-PyTorch: Code for the NLPCC 2018 paper: Distant Supervision for Relation Extraction with Neural Instance Selector
事件抽取
- [ ] benkang-chen/Event-Extraction: 事件抽取相关算法汇总
- [ ] liuhuanyong/EventTriplesExtraction: An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain and topic graph, 基于依存句法与语义角色标注的事件三元组抽取,可用于文本理解如文档主题链,事件线等应用。
实体链指
- [ ] facebookresearch/BLINK: Entity Linker solution
- [ ] generall/OneShotNLP: PyTorch text matching models implementation for One-Shot Named Entity Linking
- [ ] UKPLab/starsem2018-entity-linking: Accompanying code for our *SEM 2018 @ NAACL 2018 paper "Mixing Context Granularities for Improved Entity Linking on Question Answering Data across Entity Categories"
实体消岐
共指消解
- [ ] kentonl/e2e-coref: End-to-end Neural Coreference Resolution
- [ ] shayneobrien/coreference-resolution: Efficient and clean PyTorch reimplementation of "End-to-end Neural Coreference Resolution" (Lee et al., EMNLP 2017).