-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[dagster-dbt] Support dbt selections for dbt Cloud #28906
[dagster-dbt] Support dbt selections for dbt Cloud #28906
Conversation
select=select, | ||
exclude=exclude or "", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's an inconsistency in the handling of the exclude
parameter. While load_specs
declares it as Optional[str] = None
, when passing to DbtCloudWorkspaceDefsLoader
, it uses exclude or ""
.
This approach has a potential issue: if exclude
is a valid selection string that evaluates to False
in Python (like "0"
or "False"
), it would incorrectly be replaced with an empty string.
Consider replacing:
exclude or ""
with:
exclude if exclude is not None else ""
This ensures that only None
values are converted to empty strings, while preserving all other valid selection strings regardless of their truthiness.
select=select, | |
exclude=exclude or "", | |
select=select, | |
exclude=exclude if exclude is not None else "", |
Spotted by Diamond
Is this helpful? React 👍 or 👎 to let us know.
90d3eaa
to
7eb9e75
Compare
e52220d
to
d0582ad
Compare
7eb9e75
to
f407202
Compare
d0582ad
to
5a2600c
Compare
f407202
to
cda3d09
Compare
5a2600c
to
c3fd4b1
Compare
workspace=self, translator=DagsterDbtTranslator(), select="fqn:*", exclude="" | ||
).get_or_fetch_state() | ||
|
||
# Cache spec retrieval for a specific translator class. | ||
@lru_cache(maxsize=1) | ||
def load_specs( | ||
self, dagster_dbt_translator: Optional[DagsterDbtTranslator] = None | ||
self, | ||
dagster_dbt_translator: Optional[DagsterDbtTranslator] = None, | ||
select: str = "fqn:*", | ||
exclude: Optional[str] = None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why are these set to two different values in different places? Use of empty string vs none feels inconsistent here. We should have a "default" constant that we reference everywhere; and it should probably only be set at the top level
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I agree. I updated the code to use default constants. Since all these APIs are user-facing, I used the defaults in all of them, but not in the state back defs, where select and exclude must be passed by load_specs
.
DBT_CLOUD_DEFAULT_SELECT = "fqn:*"
DBT_CLOUD_DEFAULT_EXCLUDE = ""
select: str = "fqn:*", | ||
exclude: Optional[str] = None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yea really don't like how this is optional at every layer down the chain - feels error prone
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I want to see more consistency with how we're handling defaults here. I really don't like that they're optional all the way down the chain. We should have defaults at the top level and that's it.
cda3d09
to
bed08d7
Compare
8da9515
to
45cf1b0
Compare
bed08d7
to
004e3cc
Compare
004e3cc
to
4da6e0e
Compare
op_tags = { | ||
**({DAGSTER_DBT_SELECT_METADATA_KEY: select} if select else {}), | ||
**({DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude} if exclude else {}), | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's a subtle issue with the conditional tag creation. When exclude
is an empty string (the default value), the condition if exclude else {}
evaluates to false, so the DAGSTER_DBT_EXCLUDE_METADATA_KEY
tag won't be added to op_tags
.
Later in the code, get_subset_selection_for_context()
uses context.op.tags.get(DAGSTER_DBT_EXCLUDE_METADATA_KEY)
, which will return None
instead of an empty string. This could cause inconsistent behavior if None
and empty string are handled differently in the selection logic.
The tests expect the exclude tag to be set correctly (as shown in the assertions). Consider always including both tags:
op_tags = {
DAGSTER_DBT_SELECT_METADATA_KEY: select,
DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude,
}
This ensures consistent behavior regardless of whether the values are empty strings or not.
op_tags = { | |
**({DAGSTER_DBT_SELECT_METADATA_KEY: select} if select else {}), | |
**({DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude} if exclude else {}), | |
} | |
op_tags = { | |
DAGSTER_DBT_SELECT_METADATA_KEY: select, | |
DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude, | |
} |
Spotted by Diamond
Is this helpful? React 👍 or 👎 to let us know.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is real. Pls heed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done in e40d586
self, dagster_dbt_translator: Optional[DagsterDbtTranslator] = None | ||
self, | ||
dagster_dbt_translator: Optional[DagsterDbtTranslator] = None, | ||
select: str = DBT_CLOUD_DEFAULT_SELECT, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Specifically I think these arguments should not be optional in internal APIs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's fair - updated in b3ada91
4da6e0e
to
aebe7d5
Compare
Summary & Motivation
This PR updates the code to allow users to pass
select
andexclude
strings to thedbt_cloud_assets
decorator, and the spec loader functions for dbt Cloud.How I Tested These Changes
Additional tests with BK