Dazhong Xia

Results 26 comments of Dazhong Xia

**TL;DR: apply-diffs is the best short term solution. The medium-term step after that is moving the XBRL deduplication logic into our transformation framework so we can apply spot fixes. In...

We could in theory do this now that we can trigger builds on Batch. Doing this automatically sounds expensive, though - and the `workflow_dispatch` option already works. Do you think...

Hello! Resurrecting this issue since we're running into something similar with data.catalyst.coop as our database files have ballooned up to several GB. Our Cloud Run revisions now require huge amounts...

I was frustrated by this recently and did some more digging, here's what I found: - this appears to be happening *after* the pytest run completes, and, correspondingly, after the...

Sweet, thanks for digging into the bulk data @davidmudrauskas - hope finding all these quirks was fun in some way :) As for next steps, following in the footsteps of...

I think it's probably better to have our Resource contain references to frictionless classes - roughly corresponding to the idea of preferring "composition over inheritance." The benefit of this, to...

Tangible outcome here is: * replicate being able to run ATB with bogus settings, then figure out why the bogus settings aren't breaking the ATB run. should have failed on...

@e-belfer might deal with this incidentally as part of integrating the new ATB.

FYI @aesharpe @zschira and I just spent some time looking into these issues - broadly we see three areas of changes: archival, extraction, and downstream transforms. ### archival These changes...