GSoC Organizations is a really basic static site that lists the open-source organizations that appeared in Google Summer of Code in the past years.
It lists the organizations and all the years it appeared in with the respective link to it’s Google Summer of Code archive page.
Currently there is data available for only four years i.e. 2016-2019.
The data has been scraped from the official GSoC archive using scrapy.
Why did I make this?
While preparing for GSoC, it is a time consuming process to select which organizations to target. This is because in the official GSoC site there is no way we can directly see how many times has a organization appeared in GSoC. I think this is a important factor as people would want to contribute in organizations which appear more often in GSoC.
- Scrapy was used to scrape data from the official GSoC archive. The link to the repository of the scraper can be found in this post itself.
- Jekyll was used to build a static site to display all the data. I used the materialze jekyll theme.
There are some things that I am planning to modify in this project.
Currently this site has data of only four years. I am planning to scrape the data of all the available years in the GSoC Archive.
Also the new version will scrape the tags and other relevant info. So that the users will be able to filter the orgs according to tags, number of accepted projects etc. Also I am planning to imporove the UI of the site.