Statistics-and-Econometrics-for-Data-Science icon indicating copy to clipboard operation
Statistics-and-Econometrics-for-Data-Science copied to clipboard

Multicollinearity notebook

Open PetalsOnWind opened this issue 4 years ago • 5 comments

PetalsOnWind avatar Dec 04 '20 10:12 PetalsOnWind

Interested to work on this project! Could you tell more about this, I am new to open source! Though I have done some linear regression and logistic regression problems on kaggle.

dp-iitkgp avatar Dec 04 '20 15:12 dp-iitkgp

Multicollinearity occurs when there is a high degree of correlation between two or more explanatory variables. Due to multicollinearity, you might get a very high accuracy (or more accurately high R^2) but the errors for the coefficients happen to be so high that we cannot say with any variable actually contributes to accuracy. For identifying multicollinearity, we run a regression considering one of the explanatory variables as Y and the others as X and run a regression to get R2 and then calculate a variance inflation factor.

PetalsOnWind avatar Dec 04 '20 16:12 PetalsOnWind

I want to contribute to this @PetalsOnWind . I am a participant in GSSoc 2021

Anam118 avatar Mar 10 '21 10:03 Anam118

Hey, I'm a GSSoc'21 participant who's quite interested in statistics for data science. @PetalsOnWind Could you please assign this to me?

lekshmissunil avatar Mar 27 '21 09:03 lekshmissunil

If no updates than I would like to address this issue..GSSoC'21 participant

Dishita2602 avatar Apr 05 '21 17:04 Dishita2602