Databricks Labs

Results 30 repositories owned by Databricks Labs

cicd-templates

200
Stars
101
Forks
Watchers

Manage your Databricks deployments and CI with code.

dbx

435
Stars
119
Forks
Watchers

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

tempo

295
Stars
50
Forks
Watchers

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

automl-toolkit

189
Stars
42
Forks
Watchers

Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and se...

dataframe-rules-engine

132
Stars
29
Forks
Watchers

Extensible Rules Engine for custom Dataframe / Dataset validation

overwatch

217
Stars
59
Forks
Watchers

Capture deep metrics on one or all assets within a Databricks workspace

jupyterlab-integration

71
Stars
12
Forks
Watchers

DEPRECATED: Integrating Jupyter with Databricks via SSH

geoscan

91
Stars
19
Forks
Watchers

Geospatial clustering at massive scale

dbldatagen

272
Stars
53
Forks
Watchers

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...

databricks-sync

45
Stars
12
Forks
Watchers

An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.