Luis Cabezon Manchado comments

Results 8 comments of


                                            Luis Cabezon Manchado

Unable to load S3

Hi @AllamSudhakara: I am currently using Metorikku and I am able to write to S3 parquet files. I am using this output's configuration: ```yaml - dataFrameName: df_name outputType: File format:...

Control writing specific output in yaml file

Rigth now you could force using environment varaibles such as boolean to create empty dataframes. For example, you could query your dataframe as ```sql SELECT * FROM WHERE ${create_dataframe} =...

[SUPPORT] Deltastreamer fails with data and timestamp related exception after upgrading to EMR 6.5 and spark3

Hi, in deltastreamer this issue still exists :(

[SUPPORT] Deltastreamer fails with data and timestamp related exception after upgrading to EMR 6.5 and spark3

Hi @alexeykudinkin, Im using hudi 0.12.1 and spark 3.1.2. Im trying to execute this command: ``` spark-submit \ --conf spark.sql.legacy.parquet.datetimeRebaseModeInRead=CORRECTED \ --conf spark.sql.legacy.parquet.datetimeRebaseModeInWrite=CORRECTED \ --conf spark.sql.legacy.parquet.int96RebaseModeInRead=CORRECTED \ --conf spark.sql.legacy.parquet.int96RebaseModeInWrite=CORRECTED \...

[SUPPORT] Deltastreamer fails with data and timestamp related exception after upgrading to EMR 6.5 and spark3

Not in my case, Im still having this issue

[SUPPORT] Deltastreamer fails with data and timestamp related exception after upgrading to EMR 6.5 and spark3

Hi @Virmaline, it is quite strage. I have downloaded a full table on AWS that gives me 4 parquets (lets call them A, B, C, ,D). I have tested your...

[SUPPORT] Deltastreamer fails with data and timestamp related exception after upgrading to EMR 6.5 and spark3

Hi @Virmaline, I have checked other tables and looks like it cannot read more than four parquets. When I add four or more files, it shows my this error. Is...

[Spark] Resolves #1679 issue glue catalog

Hi @calixtofelipe, which conf are you using to run it on AWS Glue? I mean not only spark.conf `spark.databricks.delta.fixSchema.GlueCatalog`. Additional argument such as --extra-py-files and --extra-jars