🌍↔️🐍 Limit the number of simultaneous executions #2428

agahkarakuzu · 2025-11-11T03:32:47Z

Following discussions on #2413 and #1831, this PR introduces --execute-concurrency <n> only.

@agoose77 I tested this locally and it seems to be working. I did not implement an if (execute) { conditional in process::site, which as @fwkoch noted is not really needed. The only subtlety is the difference between the “effective” and “apparent” number of pages to execute, but users are free to set as low as 1 if desired, noted in the docs.

I hope I got the changeset right 👀

changeset-bot · 2025-11-11T03:32:51Z

🦋 Changeset detected

Latest commit: 60717fc

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 3 packages

Name	Type
myst-cli	Patch
mystmd	Patch
myst-migrate	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

packages/myst-cli/src/process/site.ts

bsipocz

Thank you so much for fixing this. I have only have a few minor comments.

.changeset/cute-aliens-follow.md

docs/execute-notebooks.md

Co-authored-by: Brigitta Sipőcz <[email protected]>

agoose77

@agahkarakuzu I think I'd like to change the approach here to use a semaphore, rather than limiting transforms of the pages. I am thinking of limiting execution as a particular example of concurrency control that differs from the general concurrent processing problem. A reasonable amount of transform work is network requests, so I'd like for those to remain in-flight where possible — and we probably want another setting for throttling concurrent fetch in future.

I can do the work to rework this to use e.g. https://www.npmjs.com/package/async-mutex, or if you have the capacity, feel free. The idea would be to pass the semaphore in to the transformMdast function, and use the runExclusive method to decorate the kernelExecutionTransform — I don't think concurrency is something that transform needs to worry about.

Longer term, we'll end up refactoring this to have some ExecutionOrchestrator handle this, but I think that relates to #2413.

agahkarakuzu · 2025-11-12T14:05:41Z

@agoose77 that's indeed a cleaner solution and paves the way better for an orchestration approach. I'll give it a stab.

agoose77 · 2025-11-12T14:36:57Z

What a star! ⭐

- Add debug level log info - Remove transformMdast limits (revert to original) - Pass options from commands to the session

agahkarakuzu · 2025-11-15T02:07:46Z

@agoose77 I added semaphore.runExclusive at the kernelExecutionTransform level as you suggested, and it appears to work. In the example below, the first notebook that sleeps for 10 seconds is correctly blocking the rest of the executable content.

I am just not really sure if this is elegant or acceptable:

const opts = { ...program.opts(), ...this.opts() } as SessionOpts;

But I needed a way to pass --execute-parallel to this session.

choldgraf · 2025-11-16T15:35:56Z

docs/execute-notebooks.md

+By default, up to {math}`N-1` executable files are run concurrently, where {math}`N` is the number of available CPUs.
+
+You can change this by using the `--execute-parallel <n>` option in your build command, where `<n>` sets the maximum number of executable documents to run at once. For example, using `--execute-parallel 1` will run the documents one after another.


Can you add a short "this is useful when___" type of sentence so users know why they might want to do this?

choldgraf · 2025-11-16T16:33:52Z

packages/myst-cli/src/cli/options.ts

+export function makeExecuteParallelOption() {
+  const defaultParallelism = Math.max(1, cpus().length - 1);
+  return new Option('--execute-parallel <n>', `Maximum number of notebooks to execute in parallel`)
+    .argParser(parseInt)


will this ensure that --execute-parallel foo doesn't work? Should we test for that?

What happens if --execute-parallel 0?

choldgraf

I think this looks useful! Thanks for the contribution. My main suggestion is that we add some kind of testing (both for edge cases like --execute-parallel foo and to ensure it's actually doing what we think it is).

agahkarakuzu · 2025-11-16T18:35:34Z

Thanks for the feedback @choldgraf, I’ve pushed updates to address your comments.

Regarding testing, we could add three notebooks that each wait for about five seconds, then assert that the total build time is at least fifteen seconds when --execute-parallel 1, and around 5 seconds when --execute-parallel 3?

Since the transformations run in a non-deterministic order, it is not possible to create a test case where the output of one serves as the input to another.

choldgraf · 2025-11-17T04:31:47Z

hmmm, could we make that more like .5 seconds each? I don't want to just auto-add 15 seconds to the test suite 😅

If nobody has better ideas for how to test this, I'd also be fine just leaving it and seeing if users complain about it or not

agoose77 · 2025-11-17T07:14:51Z

I think the fastest robust way to do this is to use concurrency primitives like barrier and mutex. The easiest thing is probably to create these in a python process, and share their pickled representations via the filesystem or env vars. This would require a small addition to the test harness.

We can then ensure that
A) only a single document can hold a lock at once. If the lock is busy, we fail.
B) all files grab the lock before execution can proceed for any of them. If the barrier times out, we fail.

agoose77 · 2025-11-17T14:06:20Z

Testing this logic is a bit awkward because the implementation is at the end-to-end (E2E) level, rather than within the core execution package.

This makes sense from a design standpoint, because the concept of dependencies and ordering is a higher level one than "execute this file". Unfortunately, to then test this is fiddly because we need to do so in a mostly stateless way.

For now, it's possible to implement some good-enough tests that use the filesystem and delays (although these are sensitive to kernel startup time).

Once we have dependency ordering, it should simplify our tests but we might wish to implement a helper such as a socket server that enforces resource counting (e.g. ensure this lock counter never exceeds 1, or ensure this lock counter reaches 2).

agoose77 · 2025-11-17T14:49:10Z

I've run out of time to debug this further today, so I'd welcome anyone to take it over the line. I'm not immediately sure why the tests pass locally but fail on CI.

agahkarakuzu added 4 commits November 10, 2025 21:46

Add concurrency option

7630028

Add concurrency type

e9583d7

Uniformly apply concurrency

ecd590f

[ENH] Add documentation about concurrency

28e6e60

agahkarakuzu mentioned this pull request Nov 11, 2025

Manage concurrency and dependency of executable content #2413

Open

Add changeset

f32f7a2

agoose77 reviewed Nov 11, 2025

View reviewed changes

packages/myst-cli/src/process/site.ts Outdated Show resolved Hide resolved

agahkarakuzu added 2 commits November 11, 2025 13:18

Set default n concurrency to (cpu.length - 1)

561c076

Replace --execute-concurrency -> --execute-parallel

7f08b9d

agoose77 changed the title ~~[ENH] Limit the number of simultaneous executions~~ 🌍↔️🐍 Limit the number of simultaneous executions Nov 12, 2025

refactor: run prettier

6acf8ec

agoose77 added the enhancement New feature or request label Nov 12, 2025

bsipocz reviewed Nov 12, 2025

View reviewed changes

.changeset/cute-aliens-follow.md Outdated Show resolved Hide resolved

docs/execute-notebooks.md Outdated Show resolved Hide resolved

agoose77 and others added 3 commits November 12, 2025 13:30

Update .changeset/cute-aliens-follow.md

6fd33a7

Co-authored-by: Brigitta Sipőcz <[email protected]>

Update docs/execute-notebooks.md

a40e09b

fix: add option to start, too

23bfe4f

agoose77 self-requested a review November 12, 2025 13:50

agoose77 requested changes Nov 12, 2025

View reviewed changes

zulissimeta mentioned this pull request Nov 13, 2025

Jupyterbook>2 changes facebookresearch/fairchem#1627

Open

agahkarakuzu added 8 commits November 14, 2025 17:40

Add async-mutex dependency to myst-cli

5897a44

Add executionSemaphore type to session

143213a

Pass semaphore to sessionClass

0689221

Update session constructor

0c78ab4

Decorate run session with runExclusive

c3b9efa

Revert site process

06a199f

- Reword concurrency -> paralellism

cf57f12

- Add debug level log info - Remove transformMdast limits (revert to original) - Pass options from commands to the session

- Update logging

0e6df0d

choldgraf reviewed Nov 16, 2025

View reviewed changes

agahkarakuzu added 3 commits November 16, 2025 13:11

[DOC] Explain when --execute-parallel is useful

25a9d91

[ENH] Add argument validation to --execute-parellel

e848bf3

Remove executeParallel options from start.ts

3aabdef

agoose77 added 6 commits November 17, 2025 13:20

test: add tests

b123253

test: improve test to remove order invariant

625de2b

test: improve serial testing

c0df64c

chore: run prettier

28c459d

chore: run prettier

505ea27

fix: spread env

6af772a

agoose77 added 3 commits November 17, 2025 14:12

fix: add dependency

2e0a9be

test: fix serial test

0ab1276

test: disable execution caching

60717fc

jupyter-book-pr-triage-bot bot added this to PR triage (experimental) Nov 29, 2025

akhmerov mentioned this pull request Dec 1, 2025

Allow sequential execution of notebooks jupyter-book/jupyter-book#2485

Open

davidorme mentioned this pull request Dec 1, 2025

Split the GIS practical up into different pages ImperialCollegeLondon/living_planet_eco_evo_data#2

Open

		By default, up to {math}`N-1` executable files are run concurrently, where {math}`N` is the number of available CPUs.

		You can change this by using the `--execute-parallel <n>` option in your build command, where `<n>` sets the maximum number of executable documents to run at once. For example, using `--execute-parallel 1` will run the documents one after another.

🌍↔️🐍 Limit the number of simultaneous executions #2428

Are you sure you want to change the base?

🌍↔️🐍 Limit the number of simultaneous executions #2428

Uh oh!

Conversation

agahkarakuzu commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

Uh oh!

bsipocz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

agoose77 left a comment

Choose a reason for hiding this comment

Uh oh!

agahkarakuzu commented Nov 12, 2025

Uh oh!

agoose77 commented Nov 12, 2025

Uh oh!

agahkarakuzu commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

choldgraf Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

choldgraf Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

choldgraf Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

choldgraf left a comment

Choose a reason for hiding this comment

Uh oh!

agahkarakuzu commented Nov 16, 2025

Uh oh!

choldgraf commented Nov 17, 2025

Uh oh!

agoose77 commented Nov 17, 2025

Uh oh!

agoose77 commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agoose77 commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

agahkarakuzu commented Nov 11, 2025 •

edited

Loading

changeset-bot bot commented Nov 11, 2025 •

edited

Loading

agahkarakuzu commented Nov 15, 2025 •

edited

Loading

agoose77 commented Nov 17, 2025 •

edited

Loading