David Roher

Results 16 comments of David Roher

Second part is done, still need to do the first.

Create a running Jupyter notebook server inside the container and open up a port, so you could just navigate to `localhost:xxxx` and get a running notebook.

@prrapo You're welcome! This is another good reason to [decouple the data from the images](https://github.com/droher/boxball/issues/49) -- the `load` Dockerfiles all require an upstream image to load data from, and all...

Hmm, I put in some protection against this problem here, but looks like it's not working: https://github.com/droher/boxball/blob/72c7bc05993968b0897c1bcf9f662ed1e82b2776/extract/parsers/retrosheet.py#L61 I'll try to patch. Adding a general source column across all of these...

This hasn't been resolved in the code, but I've manually removed the duplicated games from my Retresheet fork, so the newly published version should be free of this bug.

Unlikely to get to it myself, but absolutely, would be great to have if you wanted to take a shot at it. The only requirement from my end would be...

1) See this issue here - https://github.com/droher/boxball/issues/68. Game type was not included in the raw files until this summer, and there's no released version of the game parsing software that...

The readme has a link to the schema doc, perhaps not prominently enough, where you can find the [schedule doc](https://github.com/droher/boxball/blob/0df9d3f68f1cb287ada6aafed18f2c3e3b04e441/transform/src/schemas/retrosheet.py#L84). I just pulled the `schedule.parquet` file down from OneDrive and...

Yep, that's definitely a bug - this time from gaps in the data that have since been corrected by Retrosheet. Those years will be filled on the next release (which...

This is awesome, thanks so much! With respect to style, it would probably easier to format using a package like https://github.com/Python-Markdown/markdown instead of raw string formatting. If you wanted to...