GSoC-Data
GSoC-Data copied to clipboard
Prospective Visualisation and Analysis Ideas
These are some of things that can be done/ I would like to see, I could think of the top of my head right now. Will keep on adding.
- Total stipend given away by Google.
- Total number of distinct organization that participated until now
- Total number of distinct projects carried out since its inception
- Total number of distinct students who participated
- Number of participating organization vs Year - Bar Chart
- Number of Students Accepted vs Year - Line Plot
- Gender proportion of accepted students (Overall and at the year level) - Grouped Bar Chart
- Word Cloud at year level as well as Overall- Use title and description of the project to get -
- Most popular programming language
- Database, tech stack etc.
- Trend of ML, NLP etc.
- Find Orgs selected most number of times (Arrange in Descending Order)
- Find the person who did GSoC most number of times. Find him!!!
- Find the person who mentored most number of times?
- Find organizations that got most number of projects selected under GSoC
- Avg. project selected for each org over years.
For #7, we can use this or a similar/better library i guess?
Damn, most of these are pretty legit!
Find the person who did GSoC most number of times. Find him!!!
Yes!
Copying from readme (these have already been said above, but still)
Who did the most number of GSoCs? under which org?
Which org has the highest sutdent-to-mentor conversion rate? (students who first did GSoC under the org, and then became mentors)
Run some magic on the descriptions of projects over the years to find out if there is a trend of ML related projects.