Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

change UI to narrow asset events query #28903

Open
wants to merge 3 commits into
base: prha/graphql_partition_keys
Choose a base branch
from

Conversation

prha
Copy link
Member

@prha prha commented Mar 31, 2025

Summary & Motivation

This PR simplifies the data fetching for metadata plots in a couple ways.

Before:
For partitioned assets, we were using the partitionInLast parameter to filter materializations. This fetches all partition keys (expensive), and then grabs the last N materializations that show up in the last N partitions* (incorrect). We were also fetching the materialization in the most recent partition to represent the most recent materialization overall.

After:
We separate out the most recent materialization query to be just that, fetch the most recent materialization regardless of partition (this may or may not be the right decision, based on product, but it's easy to change). For partitioned assets, we now do a two-phased fetch. We first fetch the last N partition keys, using the new paginated partition key graphql fields. We then use those partition keys to pass into the latestMaterializationByPartition field, which gets the most recent materialization for the provided set of partition keys.

Breaking apart the partition history data fetching to be closer to the graphs component also means that the main components become interactive much earlier.

This should result in better correctness and massively reduce overfetching. This should also result in much better performance in Cloud.

How I Tested These Changes

Screen.Recording.2025-03-31.at.2.48.50.PM.mov

Changelog

Copy link
Member Author

prha commented Mar 31, 2025

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

Copy link

github-actions bot commented Mar 31, 2025

Deploy preview for dagit-core-storybook ready!

✅ Preview
https://dagit-core-storybook-anvt13wgm-elementl.vercel.app
https://prha-ui-partition-keys.core-storybook.dagster-docs.io

Built with commit 9d25a61.
This pull request is being automatically deployed with vercel-action

@prha prha force-pushed the prha/ui_partition_keys branch from 301de08 to b40b177 Compare April 1, 2025 22:30
@prha prha force-pushed the prha/graphql_partition_keys branch from 9b77cc2 to 1d6f53d Compare April 1, 2025 22:30
@prha prha force-pushed the prha/ui_partition_keys branch from b40b177 to 745a442 Compare April 1, 2025 22:34
@prha prha force-pushed the prha/graphql_partition_keys branch from 9842748 to 63408e2 Compare April 1, 2025 23:15
@prha prha force-pushed the prha/ui_partition_keys branch 2 times, most recently from 895733a to c4a0e83 Compare April 2, 2025 00:41
@prha prha force-pushed the prha/graphql_partition_keys branch from 63408e2 to 66476ee Compare April 2, 2025 00:41
@prha prha marked this pull request as ready for review April 2, 2025 06:50
@prha prha force-pushed the prha/ui_partition_keys branch from c4a0e83 to 4759b17 Compare April 2, 2025 06:51
Comment on lines 265 to 272
plotView === 'partition' ? (
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} />
) : (
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} />
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It appears the rendering logic is reversed from what the variable names suggest. When plotView === 'partition', the code renders <AssetTimeMetadataPlots>, and when plotView is something else (presumably 'time'), it renders <AssetPartitionMetadataPlots>.

For clarity and consistency with the variable names, this should be swapped so that:

  • When plotView === 'partition', render <AssetPartitionMetadataPlots>
  • When plotView === 'time', render <AssetTimeMetadataPlots>

This would align the component rendering with the user's selection in the ButtonGroup above.

Suggested change
plotView === 'partition' ? (
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} />
) : (
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} />
)
plotView === 'partition' ? (
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} />
) : (
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} />
)

Spotted by Diamond

Is this helpful? React 👍 or 👎 to let us know.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I guess Graphite agrees! Wow interesting

Copy link
Collaborator

@bengotow bengotow left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks awesome - left a few inline comments! I didn't realize that fetching the partition keys and then passing them in explicitly would be so much better, that's great :-o

Comment on lines 265 to 272
plotView === 'partition' ? (
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} />
) : (
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} />
)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I guess Graphite agrees! Wow interesting

const value = useMemo(() => {
const assetNode =
data?.assetNodeOrError.__typename === 'AssetNode' ? data?.assetNodeOrError : null;
const materializations = (assetNode?.latestMaterializationByPartition || []).filter((_) => !!_);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't think it's worth changing but if you want, you can also express this as filter(Boolean) (running each through the Boolean typecast essentially)

Comment on lines +141 to +152
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit);
return useAssetPartitionMaterializations(assetKey, [...partitionKeys].reverse());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There appears to be an issue with partition key ordering in this code. The partition keys go through multiple reversals which could lead to incorrect ordering in the UI:

  1. LATEST_ASSET_PARTITIONS_QUERY fetches keys in descending order with ascending: false
  2. useLatestAssetPartitionMaterializations reverses them with [...partitionKeys].reverse()
  3. The groupByPartition function (not shown here) likely reverses them again with const orderedPartitionKeys = [...definedPartitionKeys].reverse()

This triple transformation is confusing and could cause unexpected behavior. Consider either:

  • Removing the .reverse() call in useLatestAssetPartitionMaterializations
  • Updating the groupByPartition function to account for the already-reversed order

A clearer approach would be to establish a consistent ordering convention throughout the codebase and minimize transformations.

Suggested change
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit);
return useAssetPartitionMaterializations(assetKey, [...partitionKeys].reverse());
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit);
return useAssetPartitionMaterializations(assetKey, partitionKeys);

Spotted by Diamond

Is this helpful? React 👍 or 👎 to let us know.

@prha
Copy link
Member Author

prha commented Apr 2, 2025

This looks awesome - left a few inline comments! I didn't realize that fetching the partition keys and then passing them in explicitly would be so much better, that's great :-o

Yeah, it's not the fact that prefetching is so much better. It's the fact that we were using resolvers on the GrapheneAsset before, and we're now using resolvers on the GrapheneAssetNode (the difference between assetOrError vs assetNodeOrError). The latter has access to the definition, which makes it much easier to grab the last N partitions and query the materializations cheaply by them. The former does not and thus has to scan through history.

We were also using the same query to generate graphs for the case where there is an asset definition and the case where there is not. This means that we were doing the inefficient fetch for both. This PR (and #28941) changes the query for the definition-based query to use the more efficient fetch and removes the UI altogether for the non-definition case.

@prha prha force-pushed the prha/ui_partition_keys branch from 4227fbc to 9d25a61 Compare April 3, 2025 23:51
@prha prha force-pushed the prha/graphql_partition_keys branch from 66476ee to 075048f Compare April 3, 2025 23:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants