Change the repository type filter
All
Repositories list
38 repositories
- Accelerates migrations to Databricks by automating key migration activities
- Metadata driven Spark Declarative Pipelines framework for bronze/silver pipelines
- Databricks framework to validate Data Quality of pySpark DataFrames
- Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
ucx
PublicAutomated migrations to Unity Catalog- Experimental labs projects
lsql
PublicLightweight SQL execution wrapper only on top of Databricks SDK- API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
- Python Testing for Databricks
blueprint
PublicBaseline for Databricks Labs projects written in Pythonbrickster
Public- A Swiss-Army-knife for your Data Intelligence platform administration.
- 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
splunk-integration
PublicDatabricks Add-on for Splunkoverwatch
PublicCapture deep metrics on one or all assets within a Databricks workspacepylint-plugin
PublicDatabricks Plugin for PyLint- Automated provisioning of an industry Lakehouse with enterprise data model
dataframe-rules-engine
Public- Databricks SDK for R (Experimental)
databricks-sync
Public