doc2vec
doc2vec copied to clipboard
:notebook: Long(er) text representation and classification using Doc2Vec embeddings
Bumps [numpy](https://github.com/numpy/numpy) from 1.16.0 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...
Add CLI support for the following commands: - Pass a dataset to the model for training - Pass a dataset to the model for testing given a trained model path...
Add support to train the model on other text classification datasets, which will make it suitable for more use cases.
Add unit tests to trigger the model training and inference then make sure the performance doesn't worsen below a certain threshold (80%?) Depends on #14
The script now only supports training and testing given an input dataset, we need to add a new function to support prediction given a new example. - Save the trained...
Add a Jupyter notebook with the following modifications (more ideas are welcomed!) to better understand what is going on under the hood: - Exploratory analysis of the input dataset(s). -...
Use PyUP bot for automated updating of model dependencies. Depends on #15