parquet-files topic
List
parquet-files repositories
ChoETL
746
Stars
134
Forks
Watchers
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
parquet4s
277
Stars
68
Forks
Watchers
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
petastorm
1.8k
Stars
281
Forks
Watchers
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
sergeant
125
Stars
14
Forks
Watchers
:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
WebCrawlerForOnlineInflation
173
Stars
52
Forks
Watchers
Price Crawler - Tracking Price Inflation
spark-select
96
Stars
18
Forks
Watchers
A library for Spark DataFrame using MinIO Select API
osm-parquetizer
89
Stars
33
Forks
Watchers
A converter for the OSM PBFs to Parquet files
miniparquet
43
Stars
7
Forks
Watchers
Library to read a subset of Parquet files
prql-query
123
Stars
7
Forks
Watchers
Query and transform data with PRQL
Threat-Detection-and-Visualization
34
Stars
8
Forks
Watchers
Threat Detection and Visualization