pyspark-notebook topic

List pyspark-notebook repositories

PySpark

92
Stars
74
Forks
Watchers

PySpark functions and utilities with examples. Assists ETL process of data modeling

databricks-demos

25
Stars
52
Forks
Watchers

Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and more.

notebooks

22
Stars
4
Forks
Watchers

Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.

bigdata-workshop-es

21
Stars
59
Forks
Watchers

Workshop Big Data en Español

intro-to-colab-pyspark-emr

17
Stars
8
Forks
Watchers

A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics like EMR sizing, Google Colaboratory, fine-tuning PySpark jobs,...

pyspark-devcontainer

26
Stars
19
Forks
Watchers

A simple VS Code devcontainer setup for local PySpark development

Fabric-RTA-FlightStream

17
Stars
2
Forks
Watchers

Microsoft Fabric Real-time Analytics flight streaming