Description
Hi,
We have been trying to use JEG in our Production systems and with time it is becoming increasingly necessary for us to collect and monitor the metric around the kernels being spawned, users using them, and the kind of requests being made to the JEG servers.
This being said, can we receive any guidance on how we should proceed with the aggregation of these metrics from the JEG servers for it be logged for monitoring purposes. To start with, we are thinking of incorporating the collection and monitoring of these metrics through "STATSD" library. https://pypi.org/project/pystatsd/
I am not sure if this really qualifies to be an issue, but this surely can be a feature add with this being the starting point.
We are looking to collect following generic information around the setup.
- Average number of active kernels per user.
- Total number of active kernels.
- Number of active users.
- Number of active kernels per OS type (Client OS).
RPS on JEG.
- Kernel launch requests
- Refresh/reconnect requests
- Get kernel/kernelspec requests
- Shutdown/restart kernel requests etc.