Skip to content

Conversation

@austenLacy
Copy link

What

Reduces overly spammy logs that are exacerbated in larger clusters. In a cluster with 100s of tservers some of these logs were logged millions of times per hour.

@CLAassistant
Copy link

CLAassistant commented Nov 13, 2025

CLA assistant check
All committers have signed the CLA.

@netlify
Copy link

netlify bot commented Nov 13, 2025

Deploy Preview for infallible-bardeen-164bc9 ready!

Built without sensitive environment variables

Name Link
🔨 Latest commit f7097f8
🔍 Latest deploy log https://app.netlify.com/projects/infallible-bardeen-164bc9/deploys/691654bc59262000088d99ff
😎 Deploy Preview https://deploy-preview-29350--infallible-bardeen-164bc9.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@austenLacy austenLacy force-pushed the al-reduce-overly-spammy-logs branch from f7097f8 to 940de07 Compare November 17, 2025 14:05
@austenLacy austenLacy marked this pull request as ready for review November 17, 2025 15:11

auto get_stream_metadata = GetStream(stream_id, RefreshStreamMapOption::kIfInitiatedState);
if (!get_stream_metadata.ok()) {
LOG(WARNING) << "Read invalid stream id: " << stream_id << " for tablet " << tablet_id << ": "
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should NOT be seeing too many of this one

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the past hour, we've seen this log 1.7M times. This is a factor of the number of tablets of which we have around 21k currently. Not all of those tablets are being CDC'd but many thousands are.

Do you have recommendations for what we should look into since we are seeing this log as frequently as we do?

return;
}
YB_LOG_EVERY_N_SECS(WARNING, 1) << "No local transaction status tablet found";
YB_LOG_EVERY_N_SECS(WARNING, 10) << "No local transaction status tablet found";
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Setting to 10s is fine. But if you see this then it means you dont have enough transaction status tablets and should take action to create more soon .

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All of our transaction status tables are between 200-450 tablets. What's the scaling factor we should consider for these tablets?

@hari90 hari90 changed the title Reduce spammy logs in cdc service, tx manager, and rocksdb operations DocDB: Reduce spammy logs in cdc service, tx manager, and rocksdb operations Nov 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants