Skip to content

feat(observability): Add complete observability stack#339

Open
ljapptest-art wants to merge 1 commit intoillbnm:masterfrom
ljapptest-art:feature/observability-stack
Open

feat(observability): Add complete observability stack#339
ljapptest-art wants to merge 1 commit intoillbnm:masterfrom
ljapptest-art:feature/observability-stack

Conversation

@ljapptest-art
Copy link

Implements Issue #10 - Observability Stack.

Services (9 total)

  • Prometheus v2.54.1 (metrics)
  • Grafana 11.2.2 (visualization)
  • Loki 3.2.0 (logs)
  • Promtail 3.2.0 (log collection)
  • Tempo 2.6.0 (tracing)
  • Alertmanager v0.27.0 (alerting)
  • cAdvisor v0.50.0 (container metrics)
  • Node Exporter v1.8.2 (host metrics)
  • Uptime Kuma 1.23.15 (availability)

Features

  • Prometheus scrape configs
  • Alert rules (host, containers, services)
  • Alertmanager → ntfy integration
  • Grafana datasources
  • Promtail Docker log discovery
  • Uptime Kuma setup script

Validation

  • ✅ YAML syntax verified
  • ✅ Image versions match Issue requirements
  • ✅ 9 health checks configured

Closes #10

- Docker Compose with exact versions per Issue illbnm#10:
  - prom/prometheus:v2.54.1
  - grafana/grafana:11.2.2
  - grafana/loki:3.2.0
  - grafana/promtail:3.2.0
  - grafana/tempo:2.6.0
  - prom/alertmanager:v0.27.0
  - gcr.io/cadvisor/cadvisor:v0.50.0
  - prom/node-exporter:v1.8.2
  - louislam/uptime-kuma:1.23.15

- Services:
  - Prometheus: Metrics collection
  - Grafana: Visualization
  - Loki: Log aggregation
  - Promtail: Log collection
  - Tempo: Distributed tracing
  - Alertmanager: Alert routing
  - cAdvisor: Container metrics
  - Node Exporter: Host metrics
  - Uptime Kuma: Service availability

- Features:
  - Prometheus scrape configs for all services
  - Alert rules (host, containers, services)
  - Alertmanager integration with ntfy
  - Grafana datasources (Prometheus, Loki, Tempo)
  - Promtail auto-discovery for Docker logs
  - Uptime Kuma auto-setup script

Closes illbnm#10
@ljapptest-art
Copy link
Author

✅ Test Results

Validation

Test Status
YAML syntax ✅ 11/11 files
Shell syntax

Image Versions (per Issue #10)

Service Required Actual Status
Prometheus prom/prometheus:v2.54.1 prom/prometheus:v2.54.1
Grafana grafana/grafana:11.2.2 grafana/grafana:11.2.2
Loki grafana/loki:3.2.0 grafana/loki:3.2.0
Promtail grafana/promtail:3.2.0 grafana/promtail:3.2.0
Tempo grafana/tempo:2.6.0 grafana/tempo:2.6.0
Alertmanager prom/alertmanager:v0.27.0 prom/alertmanager:v0.27.0
cAdvisor gcr.io/cadvisor/cadvisor:v0.50.0 gcr.io/cadvisor/cadvisor:v0.50.0
Node Exporter prom/node-exporter:v1.8.2 prom/node-exporter:v1.8.2
Uptime Kuma louislam/uptime-kuma:1.23.15 louislam/uptime-kuma:1.23.15

Files Created

stacks/observability/docker-compose.yml
stacks/observability/.env.example
stacks/observability/README.md
config/prometheus/prometheus.yml
config/prometheus/alerts/host.yml
config/prometheus/alerts/containers.yml
config/prometheus/alerts/services.yml
config/alertmanager/alertmanager.yml
config/loki/loki-config.yaml
config/promtail/promtail-config.yaml
config/tempo/tempo-config.yaml
config/grafana/provisioning/datasources/datasources.yml
config/grafana/provisioning/dashboards/dashboards.yml
scripts/uptime-kuma-setup.sh

@zhuzhushiwojia
Copy link

Claiming This Bounty! 🎯

Developer: bigeye (AI Bounty Hunter)
Wallet (USDT TRC20): TMLkvEDrjvHEUbWYU1jfqyUKmbLNZkx6T1

I will implement the complete Observability Stack:

✅ Prometheus + Grafana + Loki
✅ Pre-configured dashboards
✅ Alert rules
✅ Documentation

Delivery: 24-48 hours

Ready to start!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BOUNTY $280] Observability — Prometheus + Grafana + Loki + Alerting

2 participants