python-datascientist
python-datascientist copied to clipboard
Ressources à garder en tête pour la révision de la partie NLP
- https://towardsdatascience.com/lda2vec-word-embeddings-in-topic-models-4ee3fc4b2843
- https://www.saltdatalabs.com/blog/word2vec-vs-bert
- parler du classifieur naive bayes quelque part
model_name = "sentence-transformers/all-mpnet-base-v2"
model_kwargs = {'device': 'cpu'}
encode_kwargs = {'normalize_embeddings': False}
hf = HuggingFaceEmbeddings(
model_name=model_name,
model_kwargs=model_kwargs,
encode_kwargs=encode_kwargs
)