Joe Walsh

Results 9 issues of Joe Walsh

For the standard vs temporal cross validation notebook: - Add another common approach: do standard CV up to a point and then a holdout set at the end

the linked repos are private [https://github.com/dssg/hitchhikers-guide/tree/master/curriculum/reproducible-ETL](https://github.com/dssg/hitchhikers-guide/tree/master/curriculum/reproducible-ETL)

In the githubflow, needs a readme -- explanations of the empty files and github-flow.*

`testing.postgresql` runs `initdb` locally. The postgres service doesn't get used.

trivial

The postgres service in the github test suite is unnecessary: https://github.com/dssg/triage/blob/ab052a7a09a430e905823fb5cf4b47cb2ffbc361/.github/workflows/test.yaml#L9-L22 `testing.postgresql` initializes a local database. The service is created but goes unused. I ran `test.yaml` without the service and...

We've been using SQLAlchemy text type for UUID columns rather than Postgres's native UUID column type. (See [here](https://github.com/dssg/triage/blob/f15b86a15484663e948255a5b2251f1c41a38595/src/triage/component/results_schema/schema.py#L169) for an example.) The latter is 1) more efficient and 2) more...

Existing implementation only uses a few fields. Expand that.

e.g. we create the same ID for Joe Walsh every time we run the pipeline, even if the training data differ (so providers can create the same IDs using only...