sentence-embeddings topic

List sentence-embeddings repositories

inltk

813
Stars
165
Forks
Watchers

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

DiffCSE

287
Stars
26
Forks
Watchers

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

BioSentVec

556
Stars
96
Forks
Watchers

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

xmnlp

1.2k
Stars
189
Forks
Watchers

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

text2vec

4.1k
Stars
378
Forks
Watchers

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

SimCSE

3.3k
Stars
499
Forks
Watchers

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

easy-bert

167
Stars
44
Forks
Watchers

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

sent2vec

131
Stars
12
Forks
Watchers

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.