vaex icon indicating copy to clipboard operation
vaex copied to clipboard

[BUG-REPORT] cannot request duplicate column names before export to arrow or parquet

Open Ben-Epstein opened this issue 1 year ago • 0 comments

Thank you for reaching out and helping us improve Vaex!

Before you submit a new Issue, please read through the documentation. Also, make sure you search through the Open and Closed Issues - your problem may already be discussed or addressed.

Description Please provide a clear and concise description of the problem. This should contain all the steps needed to reproduce the problem. A minimal code example that exposes the problem is very appreciated.

Software information

  • Vaex version (import vaex; vaex.__version__): 4.12.10
  • Vaex was installed via: pip / conda-forge / from source pip
  • OS: Mac/Linux

Additional information

import vaex

df = vaex.example()

cols =  ["x","x","y","z","y"]
df = df[cols]
df.export("file.arrow")  # fails
df.export("file.parquet")  # fails
df.export("file.csv")  # works
import vaex

df = vaex.example()

cols =  ["x","x","y","z","y"]
df = df[list(set(cols))]
df.export("file.arrow")  # works
df.export("file.parquet")  # works

Ben-Epstein avatar Sep 02 '22 21:09 Ben-Epstein