tests: fix flaky test caused by rapid IO reading and writing #2221

WilliamBergamin · 2025-04-02T20:53:22Z

Summary

All operating systems mask the fact that storage devices are slow by caching reads and writes. When you write to a file, it doesn't immediately write to the actual storage medium; it'll capture it in a cache, tell your program that the write has completed, and go and write the contents to the storage device in the background instead.

It also seems like Node.js can close a file and then try to interact with it before all data has been written to disk. In the context of the test (should close all files after writing them), the FileStateStore is asked to perform a burst of I/O disk writing followed by a read. This leads me to believe that a race condition is created where Node.js tries to interact with a file before the OS has completed data persistence to storage.

This PR updates the behavior of the flaky unit test by

reducing the number files written to disk
waiting between the write operation and read operation

Code cov test results

Requirements (place an `x` in each `[ ]`)

I've read and understood the Contributing Guidelines and have done my best effort to follow them.
I've read and agree to the Code of Conduct.

WilliamBergamin · 2025-04-02T21:29:13Z

Attempt 1 🟢
Attempt 2 🟢
Attempt 3 🟢
Attempt 4 🟢
Attempt 5 🟢
Attempt 6 🔴
Attempt 7 🟢

This did not solve the flakiness but did improve it

codecov · 2025-04-02T21:32:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.65%. Comparing base (31c60f8) to head (0d83a32).

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2221   +/-   ##
=======================================
  Coverage   92.65%   92.65%           
=======================================
  Files          38       38           
  Lines       10527    10527           
  Branches      677      677           
=======================================
  Hits         9754     9754           
  Misses        761      761           
  Partials       12       12

Flag	Coverage Δ
cli-hooks	`95.23% <ø> (ø)`
cli-test	`94.76% <ø> (ø)`
oauth	`77.39% <ø> (ø)`
socket-mode	`61.82% <ø> (ø)`
web-api	`97.94% <ø> (ø)`
webhook	`96.65% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

zimeg

@WilliamBergamin This is a super interesting deep dive 🙏 ✨

I'm now curious about the fs operations happening since AFAICT these should by synchronous? Some behind the scenes magic might not be out of the question...

Approving this now since I think it's great continuations of the epic #2159 but I'm also wondering if you noticed certain failures during development? I left comments about logging that might be related!

Overall too I'm a fan of experimenting on these tests in main after a few tests like you've done. The current failure rate is 8.97% in the last 7 days and I'm hoping these changes decrease that 🏁

packages/oauth/src/state-stores/spec-utils.ts

hello-ashleyintech · 2025-04-03T13:32:28Z

packages/oauth/src/state-stores/spec-utils.ts

@@ -53,11 +53,15 @@ export class StateStoreChaiTestRunner {
        it('should detect multiple consumption', async () => {
          const { stateStore } = this;
          const installUrlOptions = { scopes: ['channels:read'] };
-          for (let i = 0; i < 200; i++) {
+          for (let i = 0; i < 100; i++) {


is reducing the limit by 100 an arbitrary choice for testing or is there a reason behind it?

Co-authored-by: Eden Zimbelman <[email protected]>

WilliamBergamin added 12 commits April 2, 2025 12:37

tests: attempt to fix flaky behavior in file state store

a299d6c

tests: fix flaky tests caused by rapid file reading and writing

5c88b25

fix linting issue

1142d77

trigger tests

eff35a1

trigger tests

cd34e8a

try something else

2961e5f

fix lint

3a90249

comment out unit test

88518c2

comment some suff

51ee243

Update file-state-store.spec.ts

02c4a73

Update spec-utils.ts

9174abb

tighten up the PR

8a1df44

WilliamBergamin added tests M-T: Testing work only pkg:oauth applies to `@slack/oauth` labels Apr 2, 2025

WilliamBergamin requested review from mwbrooks, hello-ashleyintech and zimeg April 2, 2025 20:53

WilliamBergamin self-assigned this Apr 2, 2025

Merge branch 'main' into fix-flaky-file-state-store-2

937529a

WilliamBergamin marked this pull request as ready for review April 2, 2025 21:31

zimeg approved these changes Apr 3, 2025

View reviewed changes

packages/oauth/src/state-stores/spec-utils.ts Outdated Show resolved Hide resolved

packages/oauth/src/state-stores/spec-utils.ts Show resolved Hide resolved

hello-ashleyintech reviewed Apr 3, 2025

View reviewed changes

WilliamBergamin and others added 2 commits April 7, 2025 11:00

Update packages/oauth/src/state-stores/spec-utils.ts

ffb250f

Co-authored-by: Eden Zimbelman <[email protected]>

Update packages/oauth/src/state-stores/spec-utils.ts

b562ce0

Co-authored-by: Eden Zimbelman <[email protected]>

zimeg mentioned this pull request Apr 17, 2025

chore(deps-dev): bump nock from 13.5.6 to 14.0.3 in /packages/web-api #2223

Merged

WilliamBergamin added 2 commits April 17, 2025 18:55

Merge branch 'main' into fix-flaky-file-state-store-2

d446a61

Merge branch 'main' into fix-flaky-file-state-store-2

0d83a32

hello-ashleyintech approved these changes May 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tests: fix flaky test caused by rapid IO reading and writing #2221

tests: fix flaky test caused by rapid IO reading and writing #2221

Uh oh!

WilliamBergamin commented Apr 2, 2025 •

edited

Loading

Uh oh!

WilliamBergamin commented Apr 2, 2025

Uh oh!

codecov bot commented Apr 2, 2025 •

edited

Loading

Uh oh!

zimeg left a comment

Uh oh!

Uh oh!

Uh oh!

hello-ashleyintech Apr 3, 2025

Uh oh!

Uh oh!

tests: fix flaky test caused by rapid IO reading and writing #2221

Are you sure you want to change the base?

tests: fix flaky test caused by rapid IO reading and writing #2221

Uh oh!

Conversation

WilliamBergamin commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Requirements (place an x in each [ ])

Uh oh!

WilliamBergamin commented Apr 2, 2025

Uh oh!

codecov bot commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zimeg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hello-ashleyintech Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

WilliamBergamin commented Apr 2, 2025 •

edited

Loading

Requirements (place an `x` in each `[ ]`)

codecov bot commented Apr 2, 2025 •

edited

Loading