Fix: remove stale flannel.1 before restart k3s#5672
Fix: remove stale flannel.1 before restart k3s#5672naiming-zededa wants to merge 1 commit intolf-edge:masterfrom
Conversation
- to fix a flannel v0.27.4 possiblly introduced a nil-pointer bug in watchVXLANDevice. During the k3s transition, had a SIGSEGV - this potential fix is to remove the stale flannel.1 before the k3s restart, so the flannel fresh restart won't hit this bug Signed-off-by: naiming-zededa <naiming@zededa.com>
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #5672 +/- ##
==========================================
+ Coverage 19.52% 29.49% +9.96%
==========================================
Files 19 18 -1
Lines 3021 2417 -604
==========================================
+ Hits 590 713 +123
+ Misses 2310 1552 -758
- Partials 121 152 +31 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
@naiming-zededa , isn't any fix already available for flannel-io? It's better to integrate the proper fix.... |
|
@naiming-zededa , here is what claude says: The search results mention a relevant PR — let me look more closely at the Found It: PR #2272, Fixed in v0.27.4Your crash in
The crash is a nil pointer dereference in You're on v0.27.4 (per your stacktrace module path), which means this fix is already included in your version. That tells us one of two things:
Recommended next steps
|
|
Hey @rene , the k3s version we have, K3s v1.34.2+k3s1, the flannel v0.27.4 is included. actually it is this flannel PR 2272 introduced this crash. but we are not going to back down the k3s version (flannel is part of k3s). |
|
Also, regardless they will fix this crash later or not, this PR is a protective measure, it is good to have anyway. |
sure, but I was looking for the proper fix..... |
Description
PR dependencies
How to test and validate this PR
this panic from flannel has been seen twice, so it may not be easily reproduced.
but the testing needs to exercise multiple cluster transitions (single-node to multiple nodes, and vice versa) to make sure there is no issues from this patch, and cluster still works fine
Changelog notes
Fix: remove stale flannel.1 before restart k3s
PR Backports
Checklist
For backport PRs (remove it if it's not a backport):
And the last but not least:
check them.