-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
change UI to narrow asset events query #28903
base: prha/graphql_partition_keys
Are you sure you want to change the base?
Conversation
Warning This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
This stack of pull requests is managed by Graphite. Learn more about stacking. |
Deploy preview for dagit-core-storybook ready! ✅ Preview Built with commit 9d25a61. |
301de08
to
b40b177
Compare
9b77cc2
to
1d6f53d
Compare
b40b177
to
745a442
Compare
9842748
to
63408e2
Compare
895733a
to
c4a0e83
Compare
63408e2
to
66476ee
Compare
c4a0e83
to
4759b17
Compare
plotView === 'partition' ? ( | ||
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} /> | ||
) : ( | ||
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} /> | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It appears the rendering logic is reversed from what the variable names suggest. When plotView === 'partition'
, the code renders <AssetTimeMetadataPlots>
, and when plotView
is something else (presumably 'time'), it renders <AssetPartitionMetadataPlots>
.
For clarity and consistency with the variable names, this should be swapped so that:
- When
plotView === 'partition'
, render<AssetPartitionMetadataPlots>
- When
plotView === 'time'
, render<AssetTimeMetadataPlots>
This would align the component rendering with the user's selection in the ButtonGroup above.
plotView === 'partition' ? ( | |
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} /> | |
) : ( | |
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} /> | |
) | |
plotView === 'partition' ? ( | |
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} /> | |
) : ( | |
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} /> | |
) |
Spotted by Diamond
Is this helpful? React 👍 or 👎 to let us know.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh I guess Graphite agrees! Wow interesting
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks awesome - left a few inline comments! I didn't realize that fetching the partition keys and then passing them in explicitly would be so much better, that's great :-o
js_modules/dagster-ui/packages/ui-core/src/assets/AssetEventMetadataEntriesTable.tsx
Outdated
Show resolved
Hide resolved
plotView === 'partition' ? ( | ||
<AssetTimeMetadataPlots assetKey={assetKey} limit={100} /> | ||
) : ( | ||
<AssetPartitionMetadataPlots assetKey={assetKey} limit={120} /> | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh I guess Graphite agrees! Wow interesting
const value = useMemo(() => { | ||
const assetNode = | ||
data?.assetNodeOrError.__typename === 'AssetNode' ? data?.assetNodeOrError : null; | ||
const materializations = (assetNode?.latestMaterializationByPartition || []).filter((_) => !!_); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't think it's worth changing but if you want, you can also express this as filter(Boolean)
(running each through the Boolean typecast essentially)
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit); | ||
return useAssetPartitionMaterializations(assetKey, [...partitionKeys].reverse()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There appears to be an issue with partition key ordering in this code. The partition keys go through multiple reversals which could lead to incorrect ordering in the UI:
LATEST_ASSET_PARTITIONS_QUERY
fetches keys in descending order withascending: false
useLatestAssetPartitionMaterializations
reverses them with[...partitionKeys].reverse()
- The
groupByPartition
function (not shown here) likely reverses them again withconst orderedPartitionKeys = [...definedPartitionKeys].reverse()
This triple transformation is confusing and could cause unexpected behavior. Consider either:
- Removing the
.reverse()
call inuseLatestAssetPartitionMaterializations
- Updating the
groupByPartition
function to account for the already-reversed order
A clearer approach would be to establish a consistent ordering convention throughout the codebase and minimize transformations.
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit); | |
return useAssetPartitionMaterializations(assetKey, [...partitionKeys].reverse()); | |
const {partitionKeys} = useLatestAssetPartitions(assetKey, limit); | |
return useAssetPartitionMaterializations(assetKey, partitionKeys); |
Spotted by Diamond
Is this helpful? React 👍 or 👎 to let us know.
Yeah, it's not the fact that prefetching is so much better. It's the fact that we were using resolvers on the We were also using the same query to generate graphs for the case where there is an asset definition and the case where there is not. This means that we were doing the inefficient fetch for both. This PR (and #28941) changes the query for the definition-based query to use the more efficient fetch and removes the UI altogether for the non-definition case. |
4227fbc
to
9d25a61
Compare
66476ee
to
075048f
Compare
Summary & Motivation
This PR simplifies the data fetching for metadata plots in a couple ways.
Before:
For partitioned assets, we were using the
partitionInLast
parameter to filter materializations. This fetches all partition keys (expensive), and then grabs the last N materializations that show up in the last N partitions* (incorrect). We were also fetching the materialization in the most recent partition to represent the most recent materialization overall.After:
We separate out the most recent materialization query to be just that, fetch the most recent materialization regardless of partition (this may or may not be the right decision, based on product, but it's easy to change). For partitioned assets, we now do a two-phased fetch. We first fetch the last N partition keys, using the new paginated partition key graphql fields. We then use those partition keys to pass into the
latestMaterializationByPartition
field, which gets the most recent materialization for the provided set of partition keys.Breaking apart the partition history data fetching to be closer to the graphs component also means that the main components become interactive much earlier.
This should result in better correctness and massively reduce overfetching. This should also result in much better performance in Cloud.
How I Tested These Changes
Screen.Recording.2025-03-31.at.2.48.50.PM.mov
Changelog