ML-ProjectKart icon indicating copy to clipboard operation
ML-ProjectKart copied to clipboard

CodePeak 2025 : Pnemonia Disease Prediction

Open prathimacode-hub opened this issue 1 year ago • 12 comments
trafficstars

Define You:

  • [x] CodePeak 2025 Participant
  • [ ] Contributor

Hey OS participants, good to see you.

This project will help us in predicting the pneumonia disease using the dataset given

Dataset : https://www.kaggle.com/broach/weekly-cdc-pneumonia-cases

If you like to work on this issue, follow the given guidelines for code as well as for the README. You can compare it using different algorithms for better modelling practices.

prathimacode-hub avatar Dec 31 '23 11:12 prathimacode-hub

I would like to work on this issue. Let us use linear support vector machines to perform this classification.Let x(p dimensional vector implying p umptoms),y(1 or -1 depending on pneumonia affected or not repectively) be a et of n points. Our job is to draw a hyperplane through them andthen figure out if the n+1 th point is pneumonia affeted or not

Annie-1-code avatar Dec 31 '23 14:12 Annie-1-code

You have to use alteast 3-4 algorithms and compare the results for any issue taken for. Never go with just 1 algorithm. It isn't sufficient and not a right way to get model creation done. @Annie-1-code

prathimacode-hub avatar Dec 31 '23 15:12 prathimacode-hub

I have explored the dataset. I now want to clean the dataseti.e. remove all the non-null values. I tried looking at a simillar project Black Friday Sales- Analysis and Prediction and looked into the data cleaning part of the file ML-ProjectKart/Black Friday Sales- Analysis and Prediction/Model /black_friday_sales_analysis_and_prediction.ipynb

I could not understand why we were fill na value with 9, 12 and 9300 numbers. Could you help me undertand those or guide me toward a path to remove null values

Annie-1-code avatar Jan 01 '24 08:01 Annie-1-code

It depends on what we are trying to achieve. Either you have remove categorical columns if it's not useful for modelling, else have to replace with values. Values vary according to data in dataset. @Annie-1-code

prathimacode-hub avatar Jan 01 '24 08:01 prathimacode-hub

Follow this links for better reference : https://www.analyticsvidhya.com/blog/2021/05/dealing-with-missing-values-in-python-a-complete-guide/ https://towardsdatascience.com/6-different-ways-to-compensate-for-missing-values-data-imputation-with-examples-6022d9ca0779?gi=bb21e3e05253 https://www.freecodecamp.org/news/how-to-handle-missing-data-in-a-dataset/ @Annie-1-code

prathimacode-hub avatar Jan 01 '24 15:01 prathimacode-hub

@Annie-1-code, let me know if you need a meet call for discussion?

prathimacode-hub avatar Jan 06 '24 15:01 prathimacode-hub

Sure, what about 11am tomorrow? If that i ok with you, I can set up a meet call.

Annie-1-code avatar Jan 06 '24 16:01 Annie-1-code

3pm works for me. Once you give you a confirmation on mentioned time, I shall share out to the meet link. @Annie-1-code

prathimacode-hub avatar Jan 06 '24 18:01 prathimacode-hub

Sure. 3 pm works for me too

Annie-1-code avatar Jan 07 '24 01:01 Annie-1-code

Just few hours left for the deadline until tomorrow. @Annie-1-code

prathimacode-hub avatar Jan 08 '24 08:01 prathimacode-hub

Is your issue sorted? @Annie-1-code

prathimacode-hub avatar Jan 10 '24 09:01 prathimacode-hub

Can we meet via Google meet at 9 pm today?

Annie-1-code avatar Jan 10 '24 09:01 Annie-1-code