gobbli icon indicating copy to clipboard operation
gobbli copied to clipboard

Add helper module for exploratory descriptives

Open jasonnance opened this issue 5 years ago • 0 comments

Feature

Implement best-effort support for some descriptive stats commonly applied to text data -- keyword/n-gram counts, typical document length, distribution of classes/labels, etc.

Motivation

Helpful for people exploring a new dataset before deciding what they want to do with it or determining what kind of domain-specific preprocessing may be necessary.

Additional Details

jasonnance avatar Sep 06 '19 14:09 jasonnance