Nick Doty
Nick Doty
Is this the same as #352?
What can we pull from the IETF Datatracker? Or from RFCs which list affiliations of authors/editors in the credits? Or from some other data source? And then drop that into...
I'm also running into this problem. I thought we could fix it by setting the `errors='coerce'` option (which would create `NaT` for every instance where the datetime can't be figured...
The work on "tenure" can help with answering research questions about churn, but I don't know if we have results that are this level of detail. My notebook from 2018...
I have produced notebooks on attendance and tenure, but they aren't integrated into the main repository yet. There is also definitely more work to be done on churn and factors...
Apologies that I got behind on this for quite a while! Anticipate getting this done by early next week.
Are we agreed on this? I also prefer using full URLs, but I thought that @davidberra was trying to switch the sample notebooks to shortnames, maybe for ease of use...
I would expect effective entity resolution will need to be O(n^2), because I don't think the similarity of names can be reduced to a single dimension along which you can...
w3crawl has some particularly bad error handling on HTTP errors during mail collection, where it can stop the whole script or an entire archive's collection based on a transient network...