This project serves as a demonstration of the capabilities of Databricks in ingesting data from external sources using both SQL Queries and Notebooks. In particular, it showcases how Databricks can be used to interact with a SQL Database with the 'adventureworks' model.
To run this project, you'll need the following dependencies:
- A SQL Database with the 'adventureworks' model present in it.
Follow these steps to get started with this project:
-
Set up your Databricks environment
-
Clone this repository to your Databricks workspace:
git clone <repository_url>-
Configure your connection to the SQL Database with the 'adventureworks' model, you can use the Databricks Catalog Explorer to do so
-
Explore the following components of this project:
-
SQL Queries: Navigate to the etl_w_sql_queries directory for SQL-based data ingestion examples.
-
Notebooks: Explore the etl_w_notebooks directory for notebooks demonstrating data ingestion using Databricks notebooks.
-
-
Follow the documentation and examples in each subdirectory to understand and execute the data ingestion processes.
This project is licensed under the Apache 2.0 License. See the LICENSE file for details.
If you have any questions or need further assistance, please contact your Databricks representative