QUIPP-pipeline
QUIPP-pipeline copied to clipboard
Plausible deniability synthesiser implementation
(See further discussion in #60 from QUIPP-collab)
Implement (as a "privacy metric" in the pipeline), the Plausible Deniability metric (code here)
- [ ] include the implementation directly as both a method and a privacy measure
- [ ] customize for any method that depends on a seed (e.g. synthpop - #81)
Context: the implementation above needs a few steps, and some additional metadata that can be computed from the input data (which the implementation doesn't do).
Steps:
- [x] make example parameter json file for the method
- [ ] script to generate additional metadata from input data and in the format required by sgf
- [x] create configuration files (from parameter json and data) (this is "my.cfg" in the sgf example)
- [x] split data ("stats" - training, "records" - generating)
- [x] "attrs" - set of values in a column
- [x] "grps" - bins: treat as binwidth of 1, for now (this will be the same as "attrs", without the label)
- [ ] DAG - function to compute thresholded covariance matrix, function to compute merit score from this (see paper). DAG format needs: edge heads in vertex order, separate traversal order (must be topological order). See README.pdf
- [ ] Write "run" for the method (for Makefile/pipeline)