Martin Mauch comments

Results 403 comments of


                                            Martin Mauch

Unable to read 250MB file even with 100G driver memory and 100G executor memory

Ok, so it fails during schema inference. Are you able to specify a schema manually?

Unable to read 250MB file even with 100G driver memory and 100G executor memory

Did you try the combination of specifying a schema and using `maxRowsInMemory`?

[BUG] Incorrect date formatting if I indicate sheet Spark Read Excel

Can you try the newest version (`0.20.3`) and also the version using `.format("excel")`?

[BUG] Schema is not getting merged while reading multiple files with different schema

Does that combination of Spark and spark-excel even work?? Please always try the newest version of spark-excel when posting issues. I don't think this solves the problem in this case,...

[BUG] Cannot read files into dataframe in Databricks 9.1 LTS Runtime 3.1.2 Spark

@williamdphillips in [this line](https://github.com/crealytics/spark-excel/blob/35a5b9b8d67d133345040ab642a897cba1f6b519/src/main/scala/com/crealytics/spark/excel/DefaultSource.scala#L38) we're passing the `HadoopConfiguration` to the `WorkbookReader` and then use it [here](https://github.com/crealytics/spark-excel/blob/35a5b9b8d67d133345040ab642a897cba1f6b519/src/main/scala/com/crealytics/spark/excel/WorkbookReader.scala#L65) to actually read from the filesystem. There might be another way to read from...

[BUG] Spark Excel is Incompatible with AWS EMR v6.13 and higher

There was a [similar issue in Hudi](https://github.com/apache/hudi/issues/817). @johnboyer have you tried building spark-excel with 2.12.15? Does it fix the problem?

Use takeWhile method from Range

Hi @EnverOsmanov, thanks for the PR! I'm slightly worried that `.takeWhile` slightly modifies the semantics to stop after having found the first non-matching index. At the moment this doesn't matter...

Use takeWhile method from Range

Hmm, maybe it is to be able to do the ```scala r.getCell(_, MissingCellPolicy.CREATE_NULL_AS_BLANK) ``` @quanghgx could you chime in here?

Cannot use V2 for streaming read

The documentation reads like this is only supported for a few specific file formats: https://docs.databricks.com/ingestion/auto-loader/options.html#file-format-options Not sure if they are hard-coded somewhere, or one would need to implement a special...

test: Run integration tests against v2 as well

@CodiumAI-Agent