pyspark-notebook topic
PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
databricks-demos
Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and more.
notebooks
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
Detecting-Malicious-URL-Machine-Learning
bigdata-workshop-es
Workshop Big Data en Español
intro-to-colab-pyspark-emr
A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics like EMR sizing, Google Colaboratory, fine-tuning PySpark jobs,...
pyspark-devcontainer
A simple VS Code devcontainer setup for local PySpark development
Crime-Classification-using-PySpark
classify crime into different categories using PySpark
Fabric-RTA-FlightStream
Microsoft Fabric Real-time Analytics flight streaming