Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Safeguard boundedtrie result #34363

Draft
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

rohitsinha54
Copy link
Contributor

@rohitsinha54 rohitsinha54 commented Mar 20, 2025

in Stringset metrics which were being emitted by default from SDK and then later once service started processing and returning them a bug on parsing it on SDK side caused the job to fail with cast exception.

Since BoundedTrie metrics for lineage are guarded by flag and not emitted by default the result query will not try to parse it but since it is publicly exposed metric type someone can try to use it regardless. This safeguards till we update QM and do e2e testing.


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

@rohitsinha54
Copy link
Contributor Author

R: @robertwb

Copy link
Contributor

Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control. If you'd like to restart, comment assign set of reviewers

} catch (ClassCastException cce) {
LOG.warn(
"Failed to retrieve BoundedTrie metrics from result. Returning empty result.", cce);
return BoundedTrieResult.empty();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not super comfortable with returning an incorrect result here due to a class cast exception...

@robertwb
Copy link
Contributor

Will this code only be triggered if the user requests these values, or would it be triggered by other metrics request (like the last bug)? If the latter, could we skip these metrics rather than inject empty values on class cast failure?

@chamikaramj
Copy link
Contributor

@rohitsinha54 any updates ?

@rohitsinha54
Copy link
Contributor Author

Sorry for the late reply. This is not an urgent change (i.e. not essential for 2.64 release hence not actively following this on my time off).

This will only be triggered if the metric result contains bounded trie. So for 2.64 this would not happen unless a customer specifically opt-in to track lineage with boundedtrie or uses boundedtrie metrics (undocumented) in any other way. For the StringSet it happened in that way because lineage emission is enabled by default and was being tracked as StringSet and hence querying /metric for any metric results in StringSet being in result.

Yes, we can change it to do something else other than empty values. I will work on that. We will need this as a safeguard before we switch to bounded trie as default metric type for lineage tracking.

@rohitsinha54 rohitsinha54 marked this pull request as draft March 29, 2025 19:08
@robertwb
Copy link
Contributor

Thanks for following up.

We had a ClassCastException because of a bug + lack of any testing when StringSet was released. I don't know that we need to over-index on this bug here (both because we know we should test for it, and also because unlike the case with StringSet there's not a variety of Container types that make it hard to "guess" which one we should cast to).

But, as you say, not urgent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants