dione
dione copied to clipboard
Switch filesDF to HadoopRDD
current filesDF is both ugly and inefficient in terms of data locality. we should try to switch to something like HadoopRDD/NewHadoopRDD or something more natural to leverage the preferred locations functionality.