Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[dagster-dbt] Support dbt selections for dbt Cloud #28906

Merged
merged 4 commits into from
Apr 3, 2025

Conversation

maximearmstrong
Copy link
Contributor

@maximearmstrong maximearmstrong commented Apr 1, 2025

Summary & Motivation

This PR updates the code to allow users to pass select and exclude strings to the dbt_cloud_assets decorator, and the spec loader functions for dbt Cloud.

How I Tested These Changes

Additional tests with BK

@maximearmstrong maximearmstrong changed the title [dagster-dbt] Support dbt selections in asset decorator and spec loader for dbt Cloud [dagster-dbt] Support dbt selections for dbt Cloud Apr 1, 2025
@maximearmstrong maximearmstrong marked this pull request as ready for review April 1, 2025 02:24
@maximearmstrong maximearmstrong self-assigned this Apr 1, 2025
@maximearmstrong maximearmstrong requested a review from dpeng817 April 1, 2025 02:24
Comment on lines 239 to 240
select=select,
exclude=exclude or "",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's an inconsistency in the handling of the exclude parameter. While load_specs declares it as Optional[str] = None, when passing to DbtCloudWorkspaceDefsLoader, it uses exclude or "".

This approach has a potential issue: if exclude is a valid selection string that evaluates to False in Python (like "0" or "False"), it would incorrectly be replaced with an empty string.

Consider replacing:

exclude or ""

with:

exclude if exclude is not None else ""

This ensures that only None values are converted to empty strings, while preserving all other valid selection strings regardless of their truthiness.

Suggested change
select=select,
exclude=exclude or "",
select=select,
exclude=exclude if exclude is not None else "",

Spotted by Diamond

Is this helpful? React 👍 or 👎 to let us know.

@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from 90d3eaa to 7eb9e75 Compare April 1, 2025 13:07
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator4 branch from e52220d to d0582ad Compare April 1, 2025 13:07
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from 7eb9e75 to f407202 Compare April 1, 2025 14:05
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator4 branch from d0582ad to 5a2600c Compare April 1, 2025 14:05
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from f407202 to cda3d09 Compare April 1, 2025 14:30
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator4 branch from 5a2600c to c3fd4b1 Compare April 1, 2025 14:30
Comment on lines 222 to 231
workspace=self, translator=DagsterDbtTranslator(), select="fqn:*", exclude=""
).get_or_fetch_state()

# Cache spec retrieval for a specific translator class.
@lru_cache(maxsize=1)
def load_specs(
self, dagster_dbt_translator: Optional[DagsterDbtTranslator] = None
self,
dagster_dbt_translator: Optional[DagsterDbtTranslator] = None,
select: str = "fqn:*",
exclude: Optional[str] = None,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why are these set to two different values in different places? Use of empty string vs none feels inconsistent here. We should have a "default" constant that we reference everywhere; and it should probably only be set at the top level

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I agree. I updated the code to use default constants. Since all these APIs are user-facing, I used the defaults in all of them, but not in the state back defs, where select and exclude must be passed by load_specs.

DBT_CLOUD_DEFAULT_SELECT = "fqn:*"
DBT_CLOUD_DEFAULT_EXCLUDE = ""

Comment on lines 259 to 260
select: str = "fqn:*",
exclude: Optional[str] = None,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yea really don't like how this is optional at every layer down the chain - feels error prone

Copy link
Contributor

@dpeng817 dpeng817 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I want to see more consistency with how we're handling defaults here. I really don't like that they're optional all the way down the chain. We should have defaults at the top level and that's it.

@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from cda3d09 to bed08d7 Compare April 1, 2025 16:33
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator4 branch 2 times, most recently from 8da9515 to 45cf1b0 Compare April 1, 2025 20:40
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from bed08d7 to 004e3cc Compare April 1, 2025 20:40
Base automatically changed from maxime/ad-849/implement-asset-decorator4 to master April 1, 2025 21:30
@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from 004e3cc to 4da6e0e Compare April 1, 2025 21:30
Comment on lines 45 to 48
op_tags = {
**({DAGSTER_DBT_SELECT_METADATA_KEY: select} if select else {}),
**({DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude} if exclude else {}),
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's a subtle issue with the conditional tag creation. When exclude is an empty string (the default value), the condition if exclude else {} evaluates to false, so the DAGSTER_DBT_EXCLUDE_METADATA_KEY tag won't be added to op_tags.

Later in the code, get_subset_selection_for_context() uses context.op.tags.get(DAGSTER_DBT_EXCLUDE_METADATA_KEY), which will return None instead of an empty string. This could cause inconsistent behavior if None and empty string are handled differently in the selection logic.

The tests expect the exclude tag to be set correctly (as shown in the assertions). Consider always including both tags:

op_tags = {
    DAGSTER_DBT_SELECT_METADATA_KEY: select,
    DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude,
}

This ensures consistent behavior regardless of whether the values are empty strings or not.

Suggested change
op_tags = {
**({DAGSTER_DBT_SELECT_METADATA_KEY: select} if select else {}),
**({DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude} if exclude else {}),
}
op_tags = {
DAGSTER_DBT_SELECT_METADATA_KEY: select,
DAGSTER_DBT_EXCLUDE_METADATA_KEY: exclude,
}

Spotted by Diamond

Is this helpful? React 👍 or 👎 to let us know.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is real. Pls heed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done in e40d586

@maximearmstrong maximearmstrong requested a review from dpeng817 April 1, 2025 21:33
self, dagster_dbt_translator: Optional[DagsterDbtTranslator] = None
self,
dagster_dbt_translator: Optional[DagsterDbtTranslator] = None,
select: str = DBT_CLOUD_DEFAULT_SELECT,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Specifically I think these arguments should not be optional in internal APIs

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's fair - updated in b3ada91

@maximearmstrong maximearmstrong force-pushed the maxime/ad-849/implement-asset-decorator5 branch from 4da6e0e to aebe7d5 Compare April 3, 2025 14:59
@maximearmstrong maximearmstrong requested a review from dpeng817 April 3, 2025 15:00
@maximearmstrong maximearmstrong merged commit cb56a99 into master Apr 3, 2025
6 checks passed
@maximearmstrong maximearmstrong deleted the maxime/ad-849/implement-asset-decorator5 branch April 3, 2025 19:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants