Benjamin Schmidt

Results 85 comments of Benjamin Schmidt

Just want to +1 this, as an ideal version of a feature that would also be partially realized by https://github.com/observablehq/feedback/issues/209. I outlined my current use case on [Observable Forums](https://talk.observablehq.com/t/is-it-possible-to-import-a-notebook-from-a-file/6116/4) and...

Thanks James! This is as good a place to discuss as any. 5% error rate is just slightly higher than I'd have expected, but sounds right with name-merging issues. The...

I just want to note a couple classes of error I see on skimming the wiki edits. * Plenty digit errors (83 -> 33, etc.). When I've checked these on...

On merging: what I've done, and I believe @sergiocorreia has done as well, is to primarily identify a city as its census populations over time. [Matching code here](https://github.com/CreatingData/Historical-Populations/blob/master/merging_functions.py#L151-L174). So if...

Confirmed. In some cases this might be connected to the other issue I filed today; a foundation date might be better than nothing for back-extrapolation, possibly in concert with township...

Yeah, I also noticed 'platted' and saw this would require a bit of text mining to build up a list. I imagine this would have to be a hierarchy of...

debugging trace: ``` Nomic project: ```

Could you explain the notion of a "loading script"? I don't think I understand how the huggingface model--which seems to basically organized hierarchically--works with something like this. Especially around what...

It seems like it would be possible to create the dataset according to reasonable chunkings and then afterwards write any post-hoc loading scripts that seemed like they'd be especially important?...

As mentioned in the call, I'm slapping my parsing code online. As mentioned in the blog post this is all throwaway notebooks I wrote primarily just to get the Neue...