Jason Nance
Jason Nance
## Feature Implement best-effort support for some descriptive stats commonly applied to text data -- keyword/n-gram counts, typical document length, distribution of classes/labels, etc. ## Motivation Helpful for people exploring...
## Feature Support some best-effort methods for some of the following: clustering, topic modeling, similarity calculation, and/or visualization (via ex. UMAP). These would be helper functions that operate directly on...
## Feature Write the DLTK version number in the model metadata and either warn or raise an error if an incompatibility is detected when reloading an old model. ## Motivation...
## Feature Implement some functionality to make downloading and caching files more robust. ## Motivation There are some potential pitfalls related to filename collisions, partial downloads, and bookkeeping with the...
## Feature Tasks that take input can accept an in-memory list **or** a path to a file with some specified format with optional validation. ## Motivation Prevents users' datasets from...
## Feature Support multilabel classification instead of just multiclass classification. ## Motivation This would allow a single model to share information about multiple labels instead of training multiple models to...