Martin Mauch

Results 403 comments of Martin Mauch

Not sure if this is a typo, but afaik you need to use `===` instead of `==` when comparing columns. Also the value might need to be wrapped in `lit`.

Ok, interesting! Might be a change in the API that we'd need to account for. @gaya3dk2490 if you don't mind, you could skim the Spark changelogs if there's sth. in...

@zzeekk it would be great if you could rebase the PR and update the documentation. Please also try if the new instructions work, both in the command-line via spark-shell, as...

Hi @FlavioShneider, can you post the exact code, and in the best case also the Excel file to reproduce the issue?

I think I know what's happening: The `productId` is specified as a `STRING` in the schema, but stored as a number in the Excel file, so we run into [this...

Scala-steward usually takes care of updating dependencies. We're on [POI 5.2.3](https://github.com/crealytics/spark-excel/blob/main/build.sc#L61). Should the bug be fixed there?

Ok. Once there's a release with the fix Scala-steward should create a PR soon ish.

Good question. @pjfanning I saw you made some contributions to POI. Are you aware when a new release might be made?

@cometta @mtovmassian would it be an option to convert the file to `.xlsx` format? As @pjfanning mentioned, `.xls` is really ancient and should not be produced by current tools anymore.

Why not try ```scala .option("maxRowsInMemory", 20) // Optional, default None. If set, uses a streaming reader which can help with big files (will fail if used with xls format files)...