pinot
pinot copied to clipboard
Add functions for statistical analysis in SQL
As discussed with @siddharthteotia, consider adding some common statistical analysis methods SQL language.
Few examples:
- Pearson's coefficient
- Sampling (bernoulli/stratified)
- Histogram
- Entropy
- Linear regression
- Logistic regression
- SVM
Designing the one request - multiple (sequential) queries model for statistical functions. Planning to use mini-batch stochastic gradient descent for regression algorithms 2. 3. 4.
Supporting histogram
, entropy
like computations could also be potentially useful
Anyone working for supporting Sampling? Do we know how much effort is it going to be ? will it be few days or weeks?
@jasperjiaguo @siddharthteotia