airlines
airlines copied to clipboard
convert all times to timestamps
This would be "cleaner". Would it improve performance? Should they be indexed?
If so, this would help to clarify #5
Note that the original data contains FlightDate
.
Does this affect my installation? Does it impact the pending request at Amherst? If so, can you please reply to my earlier note to Aaron Coburn?
Thanks in advance,
Nick
On Aug 31, 2015, at 2:28 PM, Ben Baumer [email protected] wrote:
Note that the original data contains FlightDate.
— Reply to this email directly or view it on GitHub.
This doesn't (yet), but the addition of cancelled
does. The good news is I fixed the create
bug #17.
See (https://github.com/hadley/nycflights13/issues/11)
Note that hadley has made this change to nycflights13
This is done.
So now I am thinking that we do this differently. I am thinking:
- store
dep_time
andarr_time
asPOSIXct
timestamps - store
dep_delay
andarr_delay
as integers - drop
year
,month
,day
,sched_dep_time
,sched_arr_time
,hour
,minute
, andtime_hour
That might actually reduce the file size quite a bit. It would break compatibility with @hadley 's nycflights13
, but nycflights13
would still be recoverable from airlines
-- it would just require a little extra wrangling.
@nicholasjhorton what do you think of this idea?
Sounds reasonable to me.
Nick
On Jun 9, 2016, at 9:40 AM, Ben Baumer [email protected] wrote:
@nicholasjhorton what do you think of this idea?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
Nicholas Horton Professor of Statistics Department of Mathematics and Statistics, Amherst College Box 2239, 31 Quadrangle Dr Amherst, MA 01002-5000 https://www.amherst.edu/people/facstaff/nhorton