QUIPP-pipeline icon indicating copy to clipboard operation
QUIPP-pipeline copied to clipboard

Plausible deniability synthesiser implementation

Open crangelsmith opened this issue 4 years ago • 1 comments

(See further discussion in #60 from QUIPP-collab)

Implement (as a "privacy metric" in the pipeline), the Plausible Deniability metric (code here)

  • [ ] include the implementation directly as both a method and a privacy measure
  • [ ] customize for any method that depends on a seed (e.g. synthpop - #81)

crangelsmith avatar Feb 28 '20 14:02 crangelsmith

Context: the implementation above needs a few steps, and some additional metadata that can be computed from the input data (which the implementation doesn't do).

Steps:

  • [x] make example parameter json file for the method
  • [ ] script to generate additional metadata from input data and in the format required by sgf
    • [x] create configuration files (from parameter json and data) (this is "my.cfg" in the sgf example)
    • [x] split data ("stats" - training, "records" - generating)
    • [x] "attrs" - set of values in a column
    • [x] "grps" - bins: treat as binwidth of 1, for now (this will be the same as "attrs", without the label)
    • [ ] DAG - function to compute thresholded covariance matrix, function to compute merit score from this (see paper). DAG format needs: edge heads in vertex order, separate traversal order (must be topological order). See README.pdf
  • [ ] Write "run" for the method (for Makefile/pipeline)

crangelsmith avatar Feb 28 '20 14:02 crangelsmith