-
Bug ReportDescriptionI have the following setup for a small Talos cluster:
I'm using KubeSpan to connect everything together; the cloud node has a firewall with 51820/UDP allowed which satisfies the requirement of one node being publically accessible. Until recently this setup worked flawlessly, however since a brief network outage one of the controllers cannot contact the worker & vice-versa; running I've tried the following:
I imagine this is some strange firewall issue but the fact that two out of three nodes are able to successfully connect is odd. LogsNothing of note in the logs except some warnings, which I assume happen during startup as the peers are established: Are there other logs which I should check? Environment
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
|
Converted to a discussion, as this is a question. I would start ensuring that discovery works, that all nodes see each other, e.g. with After that you can check |
Beta Was this translation helpful? Give feedback.
This ended up being an issue with my Multi-WAN setup, likely some NAT weirdness where packets would get routed to the wrong WAN interface and dropped if the connections were initialized before the router completely settled.
In case anyone runs into a similar problem, my solution was to add a hook that runs when the WAN falls over to reset open connections to the remote server (e.g.
conntrack -D --dst <server ip>), which triggers KubeSpan to reconnect via the new WAN interface.