Skip to content

Conversation

costmatt-amzn
Copy link
Contributor

feat(kinesisdataanalytics): Fix Checkpoint Failure Rate calculation and add Full Restart Rate monitoring

Changes

Fixed Checkpoint Failure Rate

  • Replaced manual rate calculation with CloudWatch's native RATE() function
  • Changed from (3600 * checkpoints) / PERIOD(checkpoints) to RATE(numberOfFailedCheckpoints)

Added Full Restart Rate Monitoring

  • New metricFullRestartRate() method using RATE(fullRestarts)
  • New addFullRestartRateAlarm() with custom implementation to avoid naming conflicts
  • Enhanced Full Restarts widget to show both count (left axis) and rate (right axis)

Technical

  • Resolved alarm naming conflicts between full restart and checkpoint failure rate alarms
  • Added test coverage for new functionality
  • All 5 alarms now supported (was 4)

Provides more accurate rate monitoring and better insights into Kinesis Data Analytics application performance.

@costmatt-amzn costmatt-amzn force-pushed the feat/kinesisdatanalytics-full-restarts-rate branch from fcee57e to ddf761c Compare September 16, 2025 19:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant