fix: reduce hosts plugin refresh interval#8620
Conversation
There was a problem hiding this comment.
Pull request overview
This PR changes the systemd timer that periodically runs the AKS LocalDNS hosts setup job, reducing the refresh cadence so /etc/localdns/hosts is updated much more frequently.
Changes:
- Reduce
aks-localdns-hosts-setup.timerrefresh interval from 15 minutes to 10 seconds. - Tighten timer scheduling accuracy from 1 minute to 1 second.
- Update inline timer comments to match the new intended behavior.
|
why 10s? we went from 15min to 10s — that's ~90x more frequent. some concerns before this lands:
|
|
|
stop re-review |
Update the AKS LocalDNS hosts setup systemd timer to refresh every 10 seconds instead of every 15 minutes. Tighten timer accuracy to 1 second so the shorter cadence is honored. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Use OnUnitInactiveSec so the hosts setup service waits 10 seconds after each run completes before scheduling the next run. Add RandomizedDelaySec to de-synchronize nodes and reduce fleet-wide DNS bursts. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
30b6517 to
b3d5d92
Compare
| # Run immediately on boot | ||
| OnBootSec=0 | ||
| # Refresh every 15 minutes. AKS critical FQDN IPs can change due to load balancer | ||
| # rotation, Traffic Manager failover, or regional DNS updates. 15 minutes balances | ||
| # freshness against unnecessary DNS traffic — stale IPs would cause the hosts plugin | ||
| # to serve unreachable addresses until the next refresh. | ||
| OnUnitActiveSec=15min | ||
| # Refresh 10 seconds after each run completes. AKS critical FQDN IPs can change | ||
| # due to load balancer rotation, Traffic Manager failover, or regional DNS updates. | ||
| # Frequent refreshes keep stale IPs from causing the hosts plugin to serve |
Summary
Validation