fix: close identify stream without waiting for EOF #1664

richard-ramos · 2025-09-04T21:56:00Z

closes: #1642

github-actions · 2025-09-04T22:03:07Z

🏁 Performance Summary

Commit: d0df6e552f1095d00e99b79ea9834f5d4e4ca6fa

Scenario	Nodes	Total messages sent	Total messages received	Latency min (ms)	Latency max (ms)	Latency avg (ms)
Base test	10	100	900	0.296	1.811	0.867
Low Bandwidth rate 256kbit burst 8kbit limit 5000	10	100	900	0.206	37.458	5.617
Packet Reorder 15% 40% with 2ms delay	10	100	900	0.253	5.549	2.880
Queue Limit 5	10	100	900	0.259	2.170	0.849
Latency 100ms 20ms	10	100	900	39.796	238.521	115.322
Burst Loss 8% 30%	10	100	900	0.253	2.125	0.851
Duplication 2%	10	100	900	0.282	2.159	0.901
Corruption 0.5%	10	100	900	0.289	2.195	0.869
Packet Loss 5%	10	100	900	0.300	2.160	0.897
Combined Network Conditions	10	100	900	0.226	299.581	127.329

📊 View Latency History and full Container Resources in the Workflow Summary

richard-ramos · 2025-09-04T22:38:13Z

Interop's green again :)

arnetheduck · 2025-09-05T06:48:15Z

So .. I only have a very vague recollection of why the EOF waiting made sense and maybe it has been solved in another way since, but the way things were back then, you would close a stream but if it still had unread data associated with it - such as the "virtual" EOF marker or any in-flight unconsumed buffered data - it would linger in memory and take up resources in the stream multiplexer because "reading" from the stream is what drives "forward motion" in its operation - without anything reading, the multiplexer would then get stuck (mplex in particular) because it eventually runs out of buffer and starts blocking reads on other streams.

closeWithEOF would provide exactly that: a read loop that consumes everything from a stream as part of the close which would help it "drive" shut down in an orderly way and have its resources released.

This is similar to how, when you close a bsd socket, the kernel maintains a thread that consumes the rest of the socket and makes sure all the FINs and ACKs and so happen - we don't have a "reading" thread in general but the problem remains relevant - the other relevant thing here is here is to handle the case where the application has an ongoing readXxx and close gets called from another async task.

What's tricky about problems like this is that they manifest as "long-term" leaks where slowly there's a buildup until suddenly it stops working - if you change these parts, make sure to run long-term tests on an active network (like nimbus-eth2) and monitor for resource leaks and blocked streams.

richard-ramos · 2025-09-05T12:21:51Z

I see.
I think in this case the proper 'fix' should be to reset the stream instead of doing a close or a closeEOF, as on the remote side, they will not be sending further data. Will convert this PR back to draft and work on this then. Thank you

fix: close identify stream without waiting for EOF

d0df6e5

richard-ramos requested a review from a team as a code owner September 4, 2025 21:56

richard-ramos requested a review from vladopajic September 4, 2025 21:56

github-project-automation bot added this to nim-libp2p Sep 4, 2025

richard-ramos requested a review from gmelodie September 4, 2025 21:56

github-project-automation bot moved this to new in nim-libp2p Sep 4, 2025

github-actions bot assigned richard-ramos Sep 4, 2025

vladopajic approved these changes Sep 5, 2025

View reviewed changes

richard-ramos marked this pull request as draft September 5, 2025 12:21

arnetheduck mentioned this pull request Sep 8, 2025

experiment: fix for quic pubsub tests #1666

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: close identify stream without waiting for EOF #1664

fix: close identify stream without waiting for EOF #1664

Uh oh!

richard-ramos commented Sep 4, 2025 •

edited by vladopajic

Loading

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

richard-ramos commented Sep 4, 2025

Uh oh!

arnetheduck commented Sep 5, 2025 •

edited

Loading

Uh oh!

richard-ramos commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: close identify stream without waiting for EOF #1664

Are you sure you want to change the base?

fix: close identify stream without waiting for EOF #1664

Uh oh!

Conversation

richard-ramos commented Sep 4, 2025 • edited by vladopajic Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 4, 2025

🏁 Performance Summary

📊 View Latency History and full Container Resources in the Workflow Summary

Uh oh!

richard-ramos commented Sep 4, 2025

Uh oh!

arnetheduck commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

richard-ramos commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

richard-ramos commented Sep 4, 2025 •

edited by vladopajic

Loading

arnetheduck commented Sep 5, 2025 •

edited

Loading