Support for pyspark
Problem Description
Expected behavior
<Replace this a clear and concise description of what you would expect SDV with regards with the described problem. If possible, explain how you would like to interact with SDV and what the outcome of this interaction would be.>
Additional context
<Please provide any additional context that may be relevant to the issue here. If none, please remove this section.>
Hi @sm823zw, nice to meet you.
We prioritize feature requests based on demand and usage needs. Would you be able to provide some more context about this request?
- Which synthesizer(s) are you using? Presumably, you need pyspark because the process was taking a long time. How long was it taking?
- What is the size of the training dataset? What does the dataset represent?
- What is the overall project that you're using SDV for?
- How much synthetic data do you want to sample?
- Once the synthetic data is made, how are you planning to use it?
And anything else that would help us to prioritize. Thanks.
I think this has been discussed in #573. I can expand some more if necessary. Cheers!