python-datascientist icon indicating copy to clipboard operation
python-datascientist copied to clipboard

Ressources à garder en tête pour la révision de la partie NLP

Open linogaliana opened this issue 8 months ago • 0 comments

  • https://towardsdatascience.com/lda2vec-word-embeddings-in-topic-models-4ee3fc4b2843
  • https://www.saltdatalabs.com/blog/word2vec-vs-bert
  • parler du classifieur naive bayes quelque part
model_name = "sentence-transformers/all-mpnet-base-v2"
model_kwargs = {'device': 'cpu'}
encode_kwargs = {'normalize_embeddings': False}
hf = HuggingFaceEmbeddings(
    model_name=model_name,
    model_kwargs=model_kwargs,
    encode_kwargs=encode_kwargs
)

linogaliana avatar Jun 24 '24 09:06 linogaliana