SDGym icon indicating copy to clipboard operation
SDGym copied to clipboard

[TableGAN] ValueError: cannot reindex from a duplicate axis

Open tejuafonja opened this issue 3 years ago • 0 comments

Environment Details

Please indicate the following details about the environment in which you found the bug:

  • SDGym version: 0.5.0
  • Python version: 3.7.13
  • Operating System: Linux (run on colab)

Error Description

Running sdgym.run(synthesizers=[TableGAN], datasets=["adult"], show_progress=True, iterations=1) gives an ValueError for TableGAN synthesizer. The error seem to be triggered by synthetic_data.update(ht.reverse_transform(sampled_data)) on line 183 in the base.py file.

Steps to reproduce

import sdgym
from sdgym.synthesizers import TableGAN
sdgym.run(synthesizers=[TableGAN], datasets=["adult"], show_progress=True, iterations=1)
sdgym.run(synthesizers=[TableGAN], datasets=["adult"], show_progress=True, iterations=1)
Screenshot 2022-03-29 at 15 59 32

You can also run this colab notebook to reproduce the error.

tejuafonja avatar Mar 29 '22 14:03 tejuafonja