R-Palazzo
R-Palazzo
First pull request, Romain Palazzo for issue 119. Resolves #119 Thank you for your help :)
CU-86azzkknb Resolve #552 Resolve #555
### Error Description The minimum version workflow crashes when a library points to a Github branch. The fix should allow us to run the minimum version workflow with libraries pointing...
The goal is to propose a new column-pair metric between one numerical and one categorical column. ### Current behavior The Quality report has to discretize the numerical column and do...
The goal here is to make the `NewRowSynthesis` metric more fault-tolerant and make it faster/more efficient to run.
### Problem Description Have a method in `rdt/transformers/utils.py` `generate_seed_value` that could be used to generate seed value for the transformers that use `Faker`.
### Problem Description Currently, the `LabelEncoder(add_noise = True)` does not support 'category' pandas dtype. For instance, if I run the following code: ``` from rdt.transformers import LabelEncoder data_test = pd.DataFrame({'A':...
CU-86azvqpqe Resolve #1877 A few considerations regarding this PR: 1 - `NaNs handling`: Currently, I don't drop NaN `foreign keys` 2 - `Randomness`: For reproducibility, I set a seed, is...
### Problem Description Both `BaseSynthesizer` and `BaseMultiTableSynthesizer` contain a method called `_check_metadata_updated`. This method is intended to raise a warning if the metadata has been updated before the synthesizer definition...
CU-86b07h9tx Resolve #294