QUIPP-pipeline
QUIPP-pipeline copied to clipboard
Privacy preserving synthetic data generation workflows
As part of the [Synthetic Data and Privacy Preservation - Turing/ONS partnership project 3](https://github.com/alan-turing-institute/synData-ONS), we're trying out the QUIPP pipeline on this dataset. Note: may or may not need to...
The workflow simpop_ci.yml is referencing action actions/checkout using references v1. However this reference is missing the commit [a6747255bd19d7a757dbdda8c654a9f84db19839](https://github.com/actions/checkout/commits/a6747255bd19d7a757dbdda8c654a9f84db19839) which may contain fix to the some vulnerability. The vulnerability fix that...
To get the CTGAN method to work I had to amend the import statement Perhaps there was a reason for it being that way though? **Edit:** Actually this may be...
- Refactor examples - Add markdown document on how to reproduce experiments
I'm running make with `enabled` set to true on `ctgan-example-0.json` but getting the following error despite CTGAN library being installed: ``` (py38) MAC-ATI0379:QUIPP-pipeline echalstrey$ make mkdir -p $(dirname synth-output/2011-census-test-1/synthetic_data_1.csv) &&...
Remove PATE-GAN https://github.com/alan-turing-institute/QUIPP-pipeline/tree/feature/dataset-modifications/synth-methods/PATE-GAN - [ ] Check if this is used in any examples - [ ] Remove code - [ ] Remove dependencies from README
This PR adds support for Jenning's and Sebastian's correlated rank similarity metric. Changes: - Adds three new methods to the `RankingSimilarity` class of `rbo.py`. These implement the correlated rank metric,...
## Summary (WIP) **Warning** ⚠️ ☢️ Because I only have implemented this for CTGAN merging to develop will break the whole pipeline. So should only merge once/if every input type...