tickets icon indicating copy to clipboard operation
tickets copied to clipboard

Centralize Voting Results of the German Bundesrat

Open jfilter opened this issue 6 years ago • 2 comments

The Bundesrat publishes its voting results on 16 different websites. Let's centralize them!

https://www.bundesrat.de/DE/plenum/abstimmung/abstimmung-node.html

So first, you have to extract the text from the text-based PDFs. Tools such as https://github.com/pdfminer/pdfminer.six could help. In the next step, you have to aggregate the results for each "Drucksache". Not sure how accurately the extracted will be. Maybe some manual data cleaning is necessary. To automacally scrape the results, you could put the scrapers on https://morph.io/.

Context: https://fragdenstaat.de/blog/2018/bundesrat-transparenz-klage/

Contact person: @jfilter

jfilter avatar Nov 12 '18 22:11 jfilter

Also important: finding out how the voting works in detail: is there always a Drucksache (hint: no), which process and what part actually appears on the websites, is it the same for all states? But yes, this is a great project! Voting in Bundesrat has been a sort-of secret, you could be the first to find out patterns!

stefanw avatar Nov 13 '18 08:11 stefanw

I created a separate repo for scraping this:

https://github.com/okfde/bundesrat-scraper

stefanw avatar Dec 07 '18 09:12 stefanw