parquet-files topic

List parquet-files repositories

ChoETL

746
Stars
134
Forks
Watchers

ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)

parquet4s

277
Stars
68
Forks
Watchers

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

petastorm

1.8k
Stars
281
Forks
Watchers

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...

sergeant

125
Stars
14
Forks
Watchers

:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'

spark-select

96
Stars
18
Forks
Watchers

A library for Spark DataFrame using MinIO Select API

osm-parquetizer

89
Stars
33
Forks
Watchers

A converter for the OSM PBFs to Parquet files

miniparquet

43
Stars
7
Forks
Watchers

Library to read a subset of Parquet files

prql-query

123
Stars
7
Forks
Watchers

Query and transform data with PRQL