GoodreadsScraper
GoodreadsScraper copied to clipboard
The data parsing step adds spurious values
If the dateutil.parse
function cannot find a component of the timestamp (any of day, month or year), it replaces it with the current date's components.
This can cause problems in later steps of the analysis, where spurious patterns in time series will show up. This can be fixed either by:
- Collecting day, month and year in separate fields, using NaN where applicable
- Using NaN if the entire date cannot be captured.