Jason Dai comments

Results 106 comments of


                                            Jason Dai

convert sparkdf to pdf within arrow

You may refer to the Pandas UDF implementations in Spark for using arrow for spark df and pandas df conversion.

classification model tutorial

> > > Xshards now does not support 1) shuffle dataframe, 2) astype (data type change), 3) train_test_split, 4) duplicate whole dataframe according to one column. > > > >...

Exception happened if using orca estimator train tensorflow.keras model with xshards of pandas dataframe

@sgwhat please take a look

Exception happened if using orca estimator train tensorflow.keras model with xshards of pandas dataframe

See https://github.com/intel-analytics/BigDL/issues/4965#issuecomment-1184515330

More flexibility in PyTorch train input and outputs

I think we should take a list of ndarray as input (e.g., for xshards)? @yushan111 @sgwhat

[Orca] Tutorials for Submit and Run Programs on Yarn

Create a simple but meaningful python project (e.g., multiple python source files), and use that project as a walking example in the tutorial.

[Orca] Tutorials for Submit and Run Programs on Yarn

@hkvision please take a look

[ppml] Add PPML tutorial

At the beginning of the tutorial (before the table of contents), we need a very short (one or two sentences) description that talks about what BigDL PPML is from the...

Fix duplicate repartition in tf2 estimator spark backend

Why do we repartition previously? @jenniew

Fix duplicate repartition in tf2 estimator spark backend

> Why do we must have the number of partitions equal to the number of workers? Repartition is expensive, if the number of partitions is already larger than the number...