parquet-go icon indicating copy to clipboard operation
parquet-go copied to clipboard

Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Prest...

Results 22 parquet-go issues
Sort by recently updated
recently updated
newest added

By design, the byte array store, is limited to accept only `[]byte` and not string, so if the `map[string]interface{}` contains the string, then the library returns an error indicated that...

enhancement

It would be neat to have richer support for struct tags for auto-generated schema definitions. I added this feature to a branch off my forked repo and am happy to...

**Describe the bug** I uncovered a small bug in schema.go, the `recursiveFix` func overwrites the passed in `colPath` by using it in the `append` function to add the column name...

**Describe the bug** I am trying to load multiple concurrent parquet files into memory and try to read them row by row. I am facing OOM issue while I read...

Hi, How can I read the entire data into a slice call Foo? Is there a better option than for loop with fr.Next() as you do it here https://github.com/fraugster/parquet-go/blob/master/examples/read-low-level/main.go#L44? ```...

**Describe the bug** Using `floor` struct writer/reader: an empty slice field (e.g. []string{}) written as a list cannot be read back, resulting in error `sub-group list or bag not found`....

**Describe the bug** This library provides no protection against untrusted inputs. It is trivial to write inputs that will cause a server to OOM or crash from another issue. This...

bug
enhancement
help wanted

As a user i would like a command to aid in debugging parquet files. For instance I would like to obtain the following file stats in a single command: -...

good first issue
tooling

This PR introduces Go 1.18 generics to generalize the separate int32/int64/float32/float64 implementation into a single number type with small specializations. This deduplicates a lot of similar code across these types....