Marcin Jakubowski
Marcin Jakubowski
Unfortunately, file names are not exposed by PostWriteHandler. You have access only to partition path. Mind that when the batch is written more than a single file can be modified...
No, there's no effort planned. Feel free to propose any change, PRs are welcome. FYI, besides that "signature" metadata contains schema name which is either class name or "parquet4s_schema" if...
Great! Looking forward to your contribution!
Hi @nkconnor. I do not have plans myself atm to introduce `InputFile` as parameter. Feel free to propose a PR. Hint: probably you can add an alternative build (`read`) function...
Hi Sergey, No, it is not possible at the moment to have non-standard partition names. Parquet4s follows the most common standard which allows to restore both the column name and...
@huajiang-tubi here is an issue that is related to your PR
A support for `InputFile` and `OutputFile` is released in https://github.com/mjakubowski84/parquet4s/releases/tag/v2.12.0.
Hi Chris! Parquet4s doesn't expose file schema in its own API (it is a thing that could be added). However, you can easily access it by calling the original Java...
Regarding > a complete list of how to map scala types properly to fields check the content of [TypedSchemaDef](https://github.com/mjakubowski84/parquet4s/blob/master/core/src/main/scala/com/github/mjakubowski84/parquet4s/Schema.scala#L196) I mean... use this type class implicitly or explicitly to obtain...
> It seems like postWriteHandler gets called after each chunk of data is written? That's true. You can use postWriteHandler to implement flushing based on your own business logic. >...