Srini Kadamati
Srini Kadamati
## Problem Description Inspired by this Alexandra Fogg's issue (https://github.com/sdv-dev/Copulas/issues/387), academic journals strongly prefer using DOI's (digital object identifiers) when referencing external work. ## Solution To make it easier for...
### Environment Details Happening in Mac & Colab in both SDV 1.11 and 1.12 (haven't tried other versions) ### Error Description When using PARSynthesizer, supplying a `sequence_key` but not a...
**Problem Description** A SDV public user trained a synthesizer in SDV 1.11, saved the model PKL file, and then tried to import it into SDV 1.12. During import, SDV was...
### Environment Details SDV 1.12 ### Problem Description In our current implementation, conditional sampling thousands of rows generates thousands of pandas `FutureWarning` messages. This can actually crash Jupyter Notebook /...
### Problem Description Every column that the SDV synthesizes falls into 1 of 2 buckets: - **Modeled Columns:** The data in these columns are modeled, eg. numerical, datetime, boolean or...
### Problem Description When training synthesizer models, especially when using GAN based approaches on large datasets, it can take many hours for the computation to finish and the progress bar...
Added XetHub to the list
**Problem Description** I want to improve my ability to evaluate synthesizers with different parameters, in different environments, and against each other. **Expected behavior** As a user, I'd like the Synthesizer...
Hey there! I work at XetHub and we built a GitHub app that scales GitHub repos to handle large files (upto 100 TB). No cost for public repos. Here's an...
### Problem Description In some sequential datasets, categorical columns might follow a reliable pattern that I want my synthetic data to also follow. One example is event stream data, where...