psData icon indicating copy to clipboard operation
psData copied to clipboard

Summer Hackathon

Open christophergandrud opened this issue 10 years ago • 50 comments

Sorry everyone, I've been kind of overwhelmed with other projects the past couple of weeks.

I was thinking that to get this thing started up again it might be good for interested parties to think of a few days to a week in the summer that they would be available to work together on this close to full time.

Any interest? Preferred times?

christophergandrud avatar Apr 17 '14 15:04 christophergandrud

I may be coming to Berlin OKFest. Will decide by June. If yes, that could be ideal time. Otherwise I am not sure if I have a chance to contribute on this particular project as the hands are full with running rOpenGov infra issues.

antagomir avatar Apr 17 '14 17:04 antagomir

I'll be in Berlin for OKFest, so this might work very nicely :+1:

briatte avatar Apr 17 '14 21:04 briatte

That's great. I'll be in Berlin as well. So we should definitely meet up to cover some ground on this.

christophergandrud avatar Apr 18 '14 05:04 christophergandrud

Dear all,

OKFest 2014 Berlin (July 15-17) has published its provisional programme, and I'm thinking of buying train tickets as soon as possible to avoid the typical high fares on France-German lines.

May I ask who will be in Berlin this July, and when?

briatte avatar May 27 '14 16:05 briatte

I'll be in Berlin from 13 July. I'm clearing time in my schedule to work on it that week.

christophergandrud avatar May 27 '14 16:05 christophergandrud

I'm thinking of submitting a talk to the CSVConf which is a fringe event of the OKFest.

Any thoughts?

christophergandrud avatar May 28 '14 06:05 christophergandrud

I will inform you soon about my possible participation, seems likely and would be great to meet.

I need to focus on pending issues with rOpenGov but we can hack together and strengthen the connections across these activities.

CSVconf talk is an option, too.

antagomir avatar May 28 '14 10:05 antagomir

I agree, CSVconf sounds like a cool event.

briatte avatar May 28 '14 13:05 briatte

Great.

I quickly put together a talk description based on the original blog post I made introducing the package (It's supposed to be about a paragraph). Any comments are of course very welcome (especially for suggestions to slim it down). The deadline for submissions is 31 May:


Improving access to panel-series political science data with psData

There are many commonly used, electronically available panel-series data sets in political science. However, downloading, cleaning, and merging them together is time consuming. For example, accessing and combining Reinhart and Rogoff's fiscal costs of financial crisis data, involves downloading, cleaning, and merging 4 Excel files with over 70 individual sheets, one for each country’s data.

Researchers also regularly use variables that are combinations and/or transformations of indicators in regularly updated data sets, but which themselves aren’t regularly updated. For example, Bueno de Mesquita et al. (2003) devised two variables that they called the ‘winset’ and the ‘selectorate’. These are basically specific combinations of data in two other regularly maintained data sets. However, the winset and selectorate variables haven’t been updated alongside updates to the underlying data.

In this talk we introduce the psData R package developed under rOpenGov to solve two problems:

  1. Time wasted by political scientists (and their RAs) downloading, cleaning, and transforming commonly used data sets for their own research.
  2. Errors introduced each time custom data importation/transformation scripts are written to do what are in fact routine tasks across the community.

The psData package aims to address these problems by distributing easy to use R functions for downloading, cleaning, and merging political science panel-series data. The package is hosted on GitHub and can be easily added to and modified by the community. When an error is found in a data importation/transformation function it can be fixed and the patch distributed to all users simultaneously, improving data quality across the entire community with minimal extra effort.

christophergandrud avatar May 28 '14 15:05 christophergandrud

Hello,

Here's a remix of the abstract. I have removed the selectorate/winset example to trim it down, and tried to formulate the project in the most possible generic terms.

briatte avatar May 29 '14 09:05 briatte

I like it a lot!

If everyone else is ok with it, all we need are the number of rOpenGov OKFest attendees and we should be set.

christophergandrud avatar May 29 '14 09:05 christophergandrud

Hi, yes cool ! I think we can only give estimates of participant number so far, and I guess the extact numbers are not necessary so how about this for the concluding sentence (feel free to modify in any way you see fit): "The team behind the rOpenGov/psData project currently includes contributors from universities in three [Finland, Germany, ...?] countries, and will be present in Berlin at the time of the conference."

antagomir avatar May 29 '14 10:05 antagomir

From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte where are you based these days?

christophergandrud avatar May 29 '14 10:05 christophergandrud

This will be during my summer holiday and I'll be travelling. Wish I could be there, though.

On Thu, May 29, 2014 at 12:16 PM, Christopher Gandrud < [email protected]> wrote:

From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte https://github.com/briattewhere are you based these days?

— Reply to this email directly or view it on GitHubhttps://github.com/rOpenGov/psData/issues/12#issuecomment-44517975 .

leeper avatar May 29 '14 11:05 leeper

I'm based in France (Paris and Lille). This project is almost eligible for EU funding :) On May 29, 2014 1:12 PM, "Thomas J. Leeper" [email protected] wrote:

This will be during my summer holiday and I'll be travelling. Wish I could be there, though.

On Thu, May 29, 2014 at 12:16 PM, Christopher Gandrud < [email protected]> wrote:

From the Github members list it looks like there are at least people based in: Finland, Germany, Denmark, US. @briatte https://github.com/briattewhere are you based these days?

— Reply to this email directly or view it on GitHub< https://github.com/rOpenGov/psData/issues/12#issuecomment-44517975> .

— Reply to this email directly or view it on GitHub https://github.com/rOpenGov/psData/issues/12#issuecomment-44521745.

briatte avatar May 29 '14 15:05 briatte

Great, how about:

The team behind the rOpenGov/psData project currently includes contributors from universities in five countries, and many will be present in Berlin at the time of the conference.

christophergandrud avatar May 29 '14 15:05 christophergandrud

Sorry we'll miss you @leeper. Have good travels!

christophergandrud avatar May 29 '14 15:05 christophergandrud

Just submitted the talk idea.

Fingers crossed!

christophergandrud avatar May 30 '14 13:05 christophergandrud

Yess, good luck!

antagomir avatar May 31 '14 10:05 antagomir

@christophergandrud I will come to Berlin, and it seems also some other Finnish rOpenGov contributors. We should certainly meet and discuss all ideas - see you soon!

antagomir avatar Jun 08 '14 23:06 antagomir

@antagomir Great! I'm all signed up for the conference

Maybe we all should set a time to meet up. When would be good for you guys?

christophergandrud avatar Jun 09 '14 07:06 christophergandrud

Great! I'm only 90% certain that I'll be coming, so I didn't submit anything. csv,conf does look good and I'll probably attend that, but otherwise I haven't checked to programme in detail yet. I haven't booked any flights yet either, so plus-minus 1 day from the conference might be doable as well.

jlehtoma avatar Jun 09 '14 11:06 jlehtoma

I'm also there most certainly on July 15-17, possibly +/- day. During the conference most times are fine I think, I did not really check the program yet. Perhaps we should meet already on the first conference day July 15 (over lunch perhaps?), then we can continue during the conference when more discussion topics pop up?

antagomir avatar Jun 09 '14 12:06 antagomir

Hi all,

I'll be in Berlin from July 15 to July 20, with a French mobile number that should work for texting. It will be a true pleasure to meet you all and get back to work on psData.

screen shot 2014-06-11 at 10 18 41 am

@antagomir unfortunately, my flight won't get me in central Berlin before 3-4pm on July 15th. Any chance we could meet slightly later that day? I land in TXL round 2.30pm, can be near Alexanderplatz one hour later.

See you there!

briatte avatar Jun 11 '14 08:06 briatte

I'm fine meeting later on the 15 if that works better. I live in the general area of the conference/Alexanderplatz so am pretty flexible.

christophergandrud avatar Jun 11 '14 08:06 christophergandrud

Hi everyone

We got the CSVconf proposal accepted! The presentation is on the 15th. I'm happy to give it/coordinate with anyone interested.

christophergandrud avatar Jun 17 '14 06:06 christophergandrud

Congratz! If you wish comments to a draft or anything just send a msg.

antagomir avatar Jun 17 '14 08:06 antagomir

Great. I think I'll just do a slide deck laying out our motivation, goals, and what we've done so far.

christophergandrud avatar Jun 17 '14 09:06 christophergandrud

Great news!

Unfortunately, my plane probably lands too late for me to attend on the 15th.

Shall we pick up a time and place to all meet on that day?

briatte avatar Jun 17 '14 10:06 briatte

I'm afraid I'll have to skip the CSV,conf after all. I just booked my flights and I'll land in Berlin 19:35 on the 15th. I'm guessing I could make it to around Alexanderplatz ~21:00 which is pretty late. Let me know if you're up for a latish dinner/beer etc. I'll be flying off afternoon of Friday the 18th, so I'm free that morning as well.

jlehtoma avatar Jun 19 '14 16:06 jlehtoma