Open
Description
To measure and debug and collect metrics on Gate, however, the deployment looks like, we should support tracing and metrics using https://opentelemetry.io/ SDK such that other tooling like Grafana Tempo, Prometheus, Jaeger, etc. can receive data from a running Gate process. This will help users to gain more visibility into their Gate deployment, measure and help fix latency issues, track player sessions over time, ...
Users can also opt in to anonymous data collection to help us understand usage and performance bottlenecks.
- instrument important functions
- existing Gate events should also be otel events
- add documentation with sample grafana dashboard and loki backend (or managed alternatives like honeycomb/newrelic)