You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+3-1Lines changed: 3 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,14 +24,16 @@ Currently, the following adapters are supported:
24
24
- AWS Athena (tested manually)
25
25
- Greenplum (tested manually)
26
26
- ClickHouse (tested manually)
27
+
- Microsoft Fabric Data Warehouse (tested manually)
28
+
- Microsoft Fabric Spark (tested manually)
27
29
28
30
## Using This Package
29
31
30
32
### Cloning via dbt Package Hub
31
33
32
34
Check [dbt Hub](https://hub.getdbt.com/dbt-labs/dbt_project_evaluator/latest/) for the latest installation instructions, or [read the docs](https://docs.getdbt.com/docs/package-management) for more information on installing packages.
33
35
34
-
### Additional setup for Databricks/Spark/DuckDB/Redshift/ClickHouse
36
+
### Additional setup for Databricks/Spark/DuckDB/Redshift/ClickHouse/Fabric
35
37
36
38
In your `dbt_project.yml`, add the following config:
Copy file name to clipboardExpand all lines: docs/customization/overriding-variables.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -103,14 +103,14 @@ vars:
103
103
104
104
| variable | description | default |
105
105
| ----------- | ----------- | ----------- |
106
-
| `max_depth_dag` | limits the maximum distance between nodes calculated in `int_all_dag_relationships` | 9 for bigqueryand spark, -1 for other adatpters |
106
+
| `max_depth_dag` | limits the maximum distance between nodes calculated in `int_all_dag_relationships` | 9 for bigquery, spark, and fabric, -1 for other adapters |
107
107
| `insert_batch_size` | number of records inserted per batch when unpacking the graph into models | 10000 |
108
108
109
109
**Note on max_depth_dag**
110
110
111
111
The default behavior for limiting the relationships calculated in the `int_all_dag_relationships` model differs depending on your adapter.
112
112
113
-
- For Bigquery & Spark/Databricks the maximum distance between two nodes in your DAG, calculated in `int_all_dag_relationships`, is set by the `max_depth_dag` variable, which is defaulted to 9. So by default, `int_all_dag_relationships` contains a row for every path less than or equal to 9 nodes in length between two nodes in your DAG. This is because these adapters do not currently support recursive SQL, and queries often fail on more than 9 recursive joins.
113
+
- For BigQuery, Spark/Databricks, and Microsoft Fabric Data Warehouse the maximum distance between two nodes in your DAG, calculated in `int_all_dag_relationships`, is set by the `max_depth_dag` variable, which is defaulted to 9. So by default, `int_all_dag_relationships` contains a row for every path less than or equal to 9 nodes in length between two nodes in your DAG. This is because these adapters do not currently support recursive SQL, and queries often fail on more than 9 recursive joins.
114
114
- For all other adapters `int_all_dag_relationships` by default contains a row for every single path between two nodes in your DAG. If you experience long runtimes for the `int_all_dag_relationships` model, you may consider limiting the length of your generated DAG paths. To do this, set `max_depth_dag: {{ whatever limit you want to enforce }}`. The value of `max_depth_dag` must be greater than 2 for all DAG tests to work, and greater than `chained_views_threshold` to ensure your performance tests to work. By default, the value of this variable for these adapters is -1, which the package interprets as "no limit".
Copy file name to clipboardExpand all lines: docs/index.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,14 +25,16 @@ Currently, the following adapters are supported:
25
25
- AWS Athena (tested manually)
26
26
- Greenplum (tested manually)
27
27
- ClickHouse (tested manually)
28
+
- Microsoft Fabric Data Warehouse (tested manually)
29
+
- Microsoft Fabric Spark (tested manually)
28
30
29
31
## Using This Package
30
32
31
33
### Cloning via dbt Package Hub
32
34
33
35
Check [dbt Hub](https://hub.getdbt.com/dbt-labs/dbt_project_evaluator/latest/) for the latest installation instructions, or [read the docs](https://docs.getdbt.com/docs/package-management) for more information on installing packages.
34
36
35
-
### Additional setup for Databricks/Spark/DuckDB/Redshift
37
+
### Additional setup for Databricks/Spark/DuckDB/Redshift/Fabric
36
38
37
39
In your `dbt_project.yml`, add the following config:
38
40
@@ -64,8 +66,8 @@ Each test warning indicates the presence of a type of misalignment. To troublesh
64
66
65
67
## Limitations
66
68
67
-
### BigQueryand Databricks
69
+
### BigQuery, Databricks, and Microsoft Fabric Data Warehouse
68
70
69
-
BigQuery current support for recursive CTEs is limited and Databricks SQL doesn't support recursive CTEs.
71
+
BigQuery has limited support for recursive CTEs, while Databricks SQL and Microsoft Fabric Data Warehouse do not support them.
70
72
71
73
For those Data Warehouses, the model `int_all_dag_relationships` needs to be created by looping CTEs instead. The number of loops is configured with `max_depth_dag` and defaulted to 9. This means that dependencies between models of more than 9 levels of separation won't show in the model `int_all_dag_relationships` but tests on the DAG will still be correct. With a number of loops higher than 9 BigQuery sometimes raises an error saying the query is too complex.
0 commit comments