Statistics-and-Econometrics-for-Data-Science
Statistics-and-Econometrics-for-Data-Science copied to clipboard
Multicollinearity notebook
Interested to work on this project! Could you tell more about this, I am new to open source! Though I have done some linear regression and logistic regression problems on kaggle.
Multicollinearity occurs when there is a high degree of correlation between two or more explanatory variables. Due to multicollinearity, you might get a very high accuracy (or more accurately high R^2) but the errors for the coefficients happen to be so high that we cannot say with any variable actually contributes to accuracy. For identifying multicollinearity, we run a regression considering one of the explanatory variables as Y and the others as X and run a regression to get R2 and then calculate a variance inflation factor.
I want to contribute to this @PetalsOnWind . I am a participant in GSSoc 2021
Hey, I'm a GSSoc'21 participant who's quite interested in statistics for data science. @PetalsOnWind Could you please assign this to me?
If no updates than I would like to address this issue..GSSoC'21 participant