airlines icon indicating copy to clipboard operation
airlines copied to clipboard

convert all times to timestamps

Open beanumber opened this issue 9 years ago • 10 comments

This would be "cleaner". Would it improve performance? Should they be indexed?

beanumber avatar Aug 27 '15 19:08 beanumber

If so, this would help to clarify #5

beanumber avatar Aug 28 '15 20:08 beanumber

Note that the original data contains FlightDate.

beanumber avatar Aug 31 '15 18:08 beanumber

Does this affect my installation? Does it impact the pending request at Amherst? If so, can you please reply to my earlier note to Aaron Coburn?

Thanks in advance,

Nick

On Aug 31, 2015, at 2:28 PM, Ben Baumer [email protected] wrote:

Note that the original data contains FlightDate.

— Reply to this email directly or view it on GitHub.

nicholasjhorton avatar Aug 31 '15 18:08 nicholasjhorton

This doesn't (yet), but the addition of cancelled does. The good news is I fixed the create bug #17.

beanumber avatar Aug 31 '15 19:08 beanumber

See (https://github.com/hadley/nycflights13/issues/11)

beanumber avatar Jan 06 '16 05:01 beanumber

Note that hadley has made this change to nycflights13

beanumber avatar Apr 29 '16 12:04 beanumber

This is done.

beanumber avatar May 10 '16 18:05 beanumber

So now I am thinking that we do this differently. I am thinking:

  • store dep_time and arr_time as POSIXct timestamps
  • store dep_delay and arr_delay as integers
  • drop year, month, day, sched_dep_time, sched_arr_time, hour, minute, and time_hour

That might actually reduce the file size quite a bit. It would break compatibility with @hadley 's nycflights13, but nycflights13 would still be recoverable from airlines -- it would just require a little extra wrangling.

beanumber avatar May 12 '16 20:05 beanumber

@nicholasjhorton what do you think of this idea?

beanumber avatar Jun 09 '16 13:06 beanumber

Sounds reasonable to me.

Nick

On Jun 9, 2016, at 9:40 AM, Ben Baumer [email protected] wrote:

@nicholasjhorton what do you think of this idea?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

Nicholas Horton Professor of Statistics Department of Mathematics and Statistics, Amherst College Box 2239, 31 Quadrangle Dr Amherst, MA 01002-5000 https://www.amherst.edu/people/facstaff/nhorton

nicholasjhorton avatar Jun 09 '16 14:06 nicholasjhorton