Srini Kadamati comments

Results 111 comments of


                                            Srini Kadamati

Error when using a datetime column as a context column with PAR Synthesizer

I reproduced the error internally in this Colab Notebook: https://colab.research.google.com/drive/1SW5WxJgU5Y2ykmP0t793a5OE-LxKsw5H?authuser=1#scrollTo=sHSODwrsjwZ9

PARSynthesizer samples uniformly distributed time series data

Hi @ardulat without metadata, this might be challenging to debug but let's try! - Are the synthetic distributions uniform for context columns and non-context columns? - How many columns fall...

PARSynthesizer samples uniformly distributed time series data

Hi @ardulat apologize for the delay! It seems like there are a few issues here to discuss: **Uniform distribution for some of the time series columns** I will attempt to...

PARSynthesizer samples uniformly distributed time series data

Hi @ardulat unfortunately I wasn't able to recreate any of these issues with my own fake dataset that has matching metadata as yours. It's likely my fake dataset is too...

PARSynthesizer samples uniformly distributed time series data

Awesome @ardulat I'll take a look today and circle back! Full disclosure though, some (or all) of these might just be issues we need to open, track, and eventually address.

PARSynthesizer samples uniformly distributed time series data

Hi @ardulat when I tried the PARSynthesizer workflow with the `train.csv` dataset you shared with me, I noticed that the `steps` column seemed to be float values (e.g. 6979.0, 5104.0,...

PARSynthesizer samples uniformly distributed time series data

@ardulat I see what you mean now! Even if a column contains only integer values, if it has missing values then usually pandas will assign the float dtype to it....

PARSynthesizer samples uniformly distributed time series data

@ardulat after chatting internally, it looks like the experience around float values in the synthetic data is our intended experience. In pandas, integer columns that contain missing values are converted...

Synthetic data from PARSynthesizer does not follow original data distribution

Hi @PaudGS 👋 At the moment, our single table and multi table synthesizers are definitely a bit more mature than PARSynthesizer, our sequential synthesizer. So this difference alone might be...

Synthetic data from PARSynthesizer does not follow original data distribution

Thanks for sharing this @PaudGS I'll leave this thread open for now so we can collect more examples here from the community and hopefully invest in improving the experience over...