Skip to content

Conversation

@panta-123
Copy link
Contributor

  • Add figures to illustrate monitoring setup
  • Add configuration examples for common monitoring tools
  • Have each section for different monitoring tool.
  • add note that dashboard might be outdated.
  • remove some dev stuff from it as this is operator docs not development.
  • Listing metrics is hard to keep track and risk of being outdated. Code search link is added.

- Add figures to illustrate monitoring setup
- Add configuration examples for common monitoring tools
- Have each section for different monitoring tool.
- add note that dashboard might be outdated.
@panta-123 panta-123 requested review from bari12 and voetberg November 11, 2025 15:03
@voetberg
Copy link
Contributor

These are overall really good changes! I think it is worth it to talk about how these can be actually set up (e.g., what sort of pods should be running, what sort of infra people need to run these monitoring things, the exact executables to run a hermes daemon, etc). I am unsure if this is outside the scope of this PR though (mostly just want to call this out as something that's missing)

@panta-123
Copy link
Contributor Author

These are overall really good changes! I think it is worth it to talk about how these can be actually set up (e.g., what sort of pods should be running, what sort of infra people need to run these monitoring things, the exact executables to run a hermes daemon, etc). I am unsure if this is outside the scope of this PR though (mostly just want to call this out as something that's missing)

I think we can add hermes daemon required and point to daemon deployment doc.
But any other infrastructure deployment strategies should not be mentioned, as they are outside the rucio docs scopes.
The available inetgration to infrastructure is already listed in doc and related config choices is mentionedin this PR.

- some spacing edit
- added differet event type list
- condense some Traces description.
remove the link to
https://github.com/rucio/rucio/tree/master/tools/monitoring
as its being removed
rucio/rucio#7375 .
Remove each events json and explain how to inspect
them from the db. Added Hermes delivery format for
each options.
voetberg
voetberg previously approved these changes Dec 9, 2025
Copy link
Contributor

@rdimaio rdimaio left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please fix merge conflict

@panta-123
Copy link
Contributor Author

Please fix merge conflict

Done.

voetberg
voetberg previously approved these changes Dec 10, 2025
Capitalize some words everywhere. Clarify some
sentences. Adding backticks to some references to code or
commands or types.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants