SignalFx's Terraform SLx Module

This module provides convenient tooling for creating detectors, dashboards and other assets using best practices on the SignalFx platform.

Note: This module is considered "alpha" content and should not yet be relied on for production usage. See the TODO section below.

Features

By using this module you get the following great features:

externally managed template for generating per-service dashboards and detectors (using this Terraform module)
- versions so you can opt in to new behavior at your own pace
industry best-practices layout with RED metrics at the top
numerous best-practices from extensive dashboard research (parts 1, 2, 3, and 4)
- units and bounds wherever applicable to ease comprehension
- pleasing wider-than-tall 3-wide grid
- per-user opt-in for color blind modes
- on-chart watermarks showing SLO targets
- easy to interpret, threshold-based coloring of "instant" values
team links for dashboards and detectors
alerting based on SLO violations
- configurable (defaults to 1m) notification of SLO violations
- detectors notify team to leverage notification policies
- alerts are linked to relevant charts in dashboards for signaling problems
error budget support
- uses the error ratio (97% success SLO gives a 3% error budget)
- visualization on main dashboard
- detector that issues info level alerts to team
support for adding your own important charts below the built in content
situational awareness
- deploy events
- feature flag events

How to Use It

You'll be using the SignalFx Terraform provider.

Next, your service(s) will need to isolate their SLI metrics and any defined SLO thresholds.

Note: When you specify the queries, remember to specify the appropriate rollup policy. Depending on metric type and meaning, you might want to use average, sum, min or max!

To create resources using this module, you can then include it in your existing Terraform like so:

# You can invoke this many times, once for each service!
module "service_fartsapi_slx" {
  source = "github.com/signalfx/terraform-signalfx-slx"
  version = "0.0.1"

  service_name                          = "FartAPI"
  responsible_team                      = "abc123"
  successful_operations_sli_count_query = "data('request_duration_millis_count', filter=filter('code', '200')).sum()"
  total_operations_sli_count_query      = "data('request_duration_millis_count').sum()"
  error_operations_sli_count_query      = "data('errors_encountered_total').sum()"
  operation_time_sli_query              = "data('request_duration_millis_quantile', filter=filter('quantile', '0.990000')).mean()"
  operation_time_sli_unit               = "Millisecond"
  operation_time_slo_target             = 500
  operation_success_ratio_slo_target    = 97.00
}

# You can also define your own charts to add to the end!
resource "signalfx_time_chart" "someother_chart" {
    name = "Custom Chart!"

    program_text = <<-EOF
        A = data("cpu.utilization").publish(label="CPU Utilization")
        EOF

    time_range = 900

    plot_type = "LineChart"
    show_data_markers = true
}

# Make a dashboard group to put it in
resource "signalfx_dashboard_group" "slx_example" {
    name = "SLx Example"
    description = "Cool dashboard group"
    teams = ["abc123"]
}

# Create the actual dashboard using the output of the module. (See `chart_ids`)
resource "signalfx_dashboard" "slx_prefixed_thing" {
    name = "SLx Test Prefix Dashboard"
    dashboard_group = "${signalfx_dashboard_group.slx_example.id}"
    time_range = "-15m"

    grid {
        chart_ids = concat(module.service_a_slx.charts,
                signalfx_time_chart.someother_chart.*.id)
        width = 4
        height = 1
    }
}

Events

To work with the deploy and feature flag events, use the following event names:

Deploy for deploys with tag service that matches the service name argument
Feature Flag for feature flags with tag service that matches the service name argument

TODO

Write some accompanying content
Template vars?
More IA (service dashboards, etc)
Runbooks
Customizable event signal definitions

PROBLEMS

Can't use secondary visualization of Linear because the labels overlap when super close.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
charts		charts
dashboards		dashboards
detectors		detectors
images		images
README.md		README.md
inputs.tf		inputs.tf
main.tf		main.tf
outputs.tf		outputs.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SignalFx's Terraform SLx Module

Features

How to Use It

Events

TODO

PROBLEMS

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SignalFx's Terraform SLx Module

Features

How to Use It

Events

TODO

PROBLEMS

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages