psData
psData copied to clipboard
Summer Hackathon
Sorry everyone, I've been kind of overwhelmed with other projects the past couple of weeks.
I was thinking that to get this thing started up again it might be good for interested parties to think of a few days to a week in the summer that they would be available to work together on this close to full time.
Any interest? Preferred times?
I may be coming to Berlin OKFest. Will decide by June. If yes, that could be ideal time. Otherwise I am not sure if I have a chance to contribute on this particular project as the hands are full with running rOpenGov infra issues.
I'll be in Berlin for OKFest, so this might work very nicely :+1:
That's great. I'll be in Berlin as well. So we should definitely meet up to cover some ground on this.
Dear all,
OKFest 2014 Berlin (July 15-17) has published its provisional programme, and I'm thinking of buying train tickets as soon as possible to avoid the typical high fares on France-German lines.
May I ask who will be in Berlin this July, and when?
I'll be in Berlin from 13 July. I'm clearing time in my schedule to work on it that week.
I'm thinking of submitting a talk to the CSVConf which is a fringe event of the OKFest.
Any thoughts?
I will inform you soon about my possible participation, seems likely and would be great to meet.
I need to focus on pending issues with rOpenGov but we can hack together and strengthen the connections across these activities.
CSVconf talk is an option, too.
I agree, CSVconf sounds like a cool event.
Great.
I quickly put together a talk description based on the original blog post I made introducing the package (It's supposed to be about a paragraph). Any comments are of course very welcome (especially for suggestions to slim it down). The deadline for submissions is 31 May:
Improving access to panel-series political science data with psData
There are many commonly used, electronically available panel-series data sets in political science. However, downloading, cleaning, and merging them together is time consuming. For example, accessing and combining Reinhart and Rogoff's fiscal costs of financial crisis data, involves downloading, cleaning, and merging 4 Excel files with over 70 individual sheets, one for each country’s data.
Researchers also regularly use variables that are combinations and/or transformations of indicators in regularly updated data sets, but which themselves aren’t regularly updated. For example, Bueno de Mesquita et al. (2003) devised two variables that they called the ‘winset’ and the ‘selectorate’. These are basically specific combinations of data in two other regularly maintained data sets. However, the winset and selectorate variables haven’t been updated alongside updates to the underlying data.
In this talk we introduce the psData
R package developed under rOpenGov to solve two problems:
- Time wasted by political scientists (and their RAs) downloading, cleaning, and transforming commonly used data sets for their own research.
- Errors introduced each time custom data importation/transformation scripts are written to do what are in fact routine tasks across the community.
The psData
package aims to address these problems by distributing easy to use R functions for downloading, cleaning, and merging political science panel-series data. The package is hosted on GitHub and can be easily added to and modified by the community. When an error is found in a data importation/transformation function it can be fixed and the patch distributed to all users simultaneously, improving data quality across the entire community with minimal extra effort.
Hello,
Here's a remix of the abstract. I have removed the selectorate/winset example to trim it down, and tried to formulate the project in the most possible generic terms.
I like it a lot!
If everyone else is ok with it, all we need are the number of rOpenGov OKFest attendees and we should be set.
Hi, yes cool ! I think we can only give estimates of participant number so far, and I guess the extact numbers are not necessary so how about this for the concluding sentence (feel free to modify in any way you see fit): "The team behind the rOpenGov/psData project currently includes contributors from universities in three [Finland, Germany, ...?] countries, and will be present in Berlin at the time of the conference."
From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte where are you based these days?
This will be during my summer holiday and I'll be travelling. Wish I could be there, though.
On Thu, May 29, 2014 at 12:16 PM, Christopher Gandrud < [email protected]> wrote:
From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte https://github.com/briattewhere are you based these days?
— Reply to this email directly or view it on GitHubhttps://github.com/rOpenGov/psData/issues/12#issuecomment-44517975 .
I'm based in France (Paris and Lille). This project is almost eligible for EU funding :) On May 29, 2014 1:12 PM, "Thomas J. Leeper" [email protected] wrote:
This will be during my summer holiday and I'll be travelling. Wish I could be there, though.
On Thu, May 29, 2014 at 12:16 PM, Christopher Gandrud < [email protected]> wrote:
From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte https://github.com/briattewhere are you based these days?
— Reply to this email directly or view it on GitHub< https://github.com/rOpenGov/psData/issues/12#issuecomment-44517975> .
— Reply to this email directly or view it on GitHub https://github.com/rOpenGov/psData/issues/12#issuecomment-44521745.
Great, how about:
The team behind the rOpenGov/psData project currently includes contributors from universities in five countries, and many will be present in Berlin at the time of the conference.
Sorry we'll miss you @leeper. Have good travels!
Just submitted the talk idea.
Fingers crossed!
Yess, good luck!
@christophergandrud I will come to Berlin, and it seems also some other Finnish rOpenGov contributors. We should certainly meet and discuss all ideas - see you soon!
@antagomir Great! I'm all signed up for the conference
Maybe we all should set a time to meet up. When would be good for you guys?
Great! I'm only 90% certain that I'll be coming, so I didn't submit anything. csv,conf does look good and I'll probably attend that, but otherwise I haven't checked to programme in detail yet. I haven't booked any flights yet either, so plus-minus 1 day from the conference might be doable as well.
I'm also there most certainly on July 15-17, possibly +/- day. During the conference most times are fine I think, I did not really check the program yet. Perhaps we should meet already on the first conference day July 15 (over lunch perhaps?), then we can continue during the conference when more discussion topics pop up?
Hi all,
I'll be in Berlin from July 15 to July 20, with a French mobile number that should work for texting. It will be a true pleasure to meet you all and get back to work on psData
.
@antagomir unfortunately, my flight won't get me in central Berlin before 3-4pm on July 15th. Any chance we could meet slightly later that day? I land in TXL round 2.30pm, can be near Alexanderplatz one hour later.
See you there!
I'm fine meeting later on the 15 if that works better. I live in the general area of the conference/Alexanderplatz so am pretty flexible.
Hi everyone
We got the CSVconf proposal accepted! The presentation is on the 15th. I'm happy to give it/coordinate with anyone interested.
Congratz! If you wish comments to a draft or anything just send a msg.
Great. I think I'll just do a slide deck laying out our motivation, goals, and what we've done so far.
Great news!
Unfortunately, my plane probably lands too late for me to attend on the 15th.
Shall we pick up a time and place to all meet on that day?
I'm afraid I'll have to skip the CSV,conf after all. I just booked my flights and I'll land in Berlin 19:35 on the 15th. I'm guessing I could make it to around Alexanderplatz ~21:00 which is pretty late. Let me know if you're up for a latish dinner/beer etc. I'll be flying off afternoon of Friday the 18th, so I'm free that morning as well.