beam
beam copied to clipboard
[Feature Request]: Implement 'DataFrame.get_dummies()' for deferred DataFrame
What would you like to happen?
We would like to use the DataFrame API to perform one-hot encoding on categorical columns. Currently, this can be done with pd.get_dummies() method but it's currently not supported for deffered DataFrames.
Issue Priority
Priority: 2
Issue Component
Component: dsl-dataframe
Let's track this as part of https://github.com/apache/beam/issues/20958