joshua-oss
joshua-oss
Convert full rewritten SQL AST to yarrow analysis graph to run whole query in-memory
Can implement with cross join and right outer join, or by caching distinct values and post-processing. If post-processing, unsorted results should be shuffled
https://github.com/opendp/smartnoise-samples/blob/master/whitepaper-demos/5-ml-synthetic-data.ipynb In the cell with `def QuailSynth(epsilon)`: Complains because epsilon is now a required param. Adding epsilon to the `PyTorchDPSynthesizer` and the gan clears that error, but then the code...
data:image/s3,"s3://crabby-images/f495c/f495cd91a70963b91357cf8bf944b104df0b7fd3" alt="patectgan" PATECTGAN performs poorly with categorical values, and seems to have been broken since at least 0.2.1. Continuous values work OK in PATECTGAN, and categorical and continuous both work OK...
Transforms are now capable of generating metadata, which introduces a dependency between smartnoise-sql and smartnoise-synth. These packages should be able to be independently installed. A possible solution is to move...
Tabular queries and tabular synthetic data are amenable to attack by the "Leave One Out" idealized attack in [1]. This can be used in conjunction with the Bayesian empirical privacy...
There are common cases where categorical columns have strong dependencies which are considered public and should be preserved. For example, a table that has State and ZIP code columns should...
`pip install opendp` into a fresh Conda environment succeeds, but attempting to import the library throws exception: "Expected exactly one binary to be present". **To Reproduce** (device is Intel x64)...