Skip to content

monitor.berlin.freifunk.net Performance optimization  #102

@nicolasberens

Description

@nicolasberens

The current state, especially with collectd is a bit suboptimal.

This is a draft to plan what can be done

There are a few things i am not happy with:

  • routers send data every 30 seconds
  • collectd works internally with a 10 second interval.
  • therefore all RRD files are created with a 10 second step size
  • RRDfiles only support one interval, so we need to recreate them or write a script that does it
  • prometheus scrapes it every 15 seconds

To test:

  • can we reduce the step size of files ?
  • does it save data/writes?
  • do we maybe want to drop some datapoints?
  • does a higher interval for RRD files make sense? (from 30 to 60 seconds?)

Planned Goal:

  • adjust settings where possible without big migrations
    • => prometheus
  • rewrite the collectd config to support chains: https://github.com/collectd/collectd/wiki/Chains
    • test if its possible to change the intervall per metric
    • migrate metrics 1:1 (e.g. cange interval of cpu load, rewrite rrd files, ten change the next metric)

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions