Simon Dedman

Results 169 comments of Simon Dedman

- Context to AUC values: https://stats.stackexchange.com/questions/71946/overfitting-a-logistic-regression-model - Training Vs CV/prediction AUC = overfitting: https://stats.stackexchange.com/questions/220807/is-overfitted-model-with-higher-auc-on-test-sample-better-than-not-overfitted-on - Similar, me cited: https://stats.stackexchange.com/questions/136182/training-auc-and-cv-auc-in-boosted-regression-tree - ML models overfit SDMs due to unavoidable spatial colinearity? See...

Lit review started here: https://docs.google.com/document/d/1DybTZs6j4rUWaIBlbIcobN8839kK53nP873-mi3wt9k/edit?usp=sharing Please add your stuff. I'll finish adding papers then populate out their sections.

[2019.10.25 Lies, Damned Lies, and Accuracy Metrics in Machine Learning.odt](https://github.com/SimonDedman/gbm.auto/files/7392293/2019.10.25.Lies.Damned.Lies.and.Accuracy.Metrics.in.Machine.Learning.odt) [2021-03-02_Ashley_Jester_Stats_Consultation.odt](https://github.com/SimonDedman/gbm.auto/files/7392304/2021-03-02_Ashley_Jester_Stats_Consultation.odt) evaluating machine learning models: https://learning.oreilly.com/library/view/evaluating-machine-learning/9781492048756/

Conventional machine vs deep learning: classifying goliath grouper inertial measurement unit data into behaviours. [Matthew’s Correlation Coefficient, Cohen’s Kappa]. 25 https://bls.econference.io/public//main/sessions/3679 Lauran Brewster. Ask Lauran about this.

![image](https://user-images.githubusercontent.com/4599748/139292798-65ddf28d-7eb9-4ed2-8f93-71788552091d.png) my image from 2021-03-02_Ashley_Jester_Stats_Consultation.odt above

https://www.statology.org/balanced-accuracy/

The correlation between predicted habitat values (averaged to 5° x 5° spatial resolution) and CPUE was calculated using a modified t-test from the R package ‘SpatialPack’ (Osorio et al., 2020)...

https://mlr.mlr-org.com/articles/tutorial/measures.html

https://scikit-learn.org/stable/modules/model_evaluation.html see also https://scikit-learn.org/stable/model_selection.html#model-selection & generally https://scikit-learn.org/stable/index.html

Elith 2008: “Predictive performance should not be estimated on training data, but results are provided in Table 3 to show that BRT overfits the data, regardless of careful model development....