sentence-embeddings topic
inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Structured-Self-Attention
A Structured Self-attentive Sentence Embedding
Spherical-Text-Embedding
[NeurIPS 2019] Spherical Text Embedding
easy-bert
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
sent2vec
How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.