datascience-sp14
datascience-sp14 copied to clipboard
Repository for data science course Spring 14
Hello! I have read your excellent material on the lab8. I have appreciated the pictures that you have added explaining the partitioning of the data across the worker nodes. I...
Hello, I'm a new in big data field. I'm interesting learning spark, but Taiwan where I live is a few people about it. So I want to try to translate...
``` python half_hour_grouped = log_df.groupby(lambda row: pd.to_datetime(str(log_df['date'][row].hour) + ":" + str((log_df['date'][row].minute / 30)*30))) ``` is really smart but maybe better to highlight the nature of the integer division, ``` python...