fix(core): keep web up during graceful shutdown #11917

yuri1969 · 2025-10-09T17:14:06Z

What changes are being made and why?

The @PreDestroy hook is triggered very late in the lifecycle.

Started the worker's graceful cleaning process using the global ShutdownEvent which fires way earlier.

I was on the fence with removing the worker's @PreDestroy hook.

How the changes have been QAed?

watch -n 1 'curl http://localhost:8081/prometheus'

Running the standalone server, executing the following flow and then quitting the server:

id: crane_325035
namespace: company.team

tasks:
  - id: sleep
    type: io.kestra.plugin.scripts.shell.Script
    script: |
      trap 'echo "nope"' INT
      sleep 60

loicmathieu

Shouldn't the @PreDestroy annotation be removed from the Worker?
Anyway close() is safe to be used multiple time but as we move the closing of the Worker here it may be a good idea to make it clear (and also add a comment inside the Worker itself).

@fhussonnois WDYT?

fhussonnois

Hey @yuri1969, thank you for your contribution. I'm OK with the use of an ApplicationEventListener to manage this issue. However, the proposed solution is not ideal for me. We should not explicitly reference the worker, because it will only work for standalone. In addition, I think the HTTP server should be keep up for any services.

So maybe we could just delay the shutdown of the embedded netty server while all services are not closed, with something like this:

=> This needs to be tested:

@Singleton
@Slf4j
@Order(Ordered.LOWEST_PRECEDENCE)
@Requires(property = "kestra.server-type")
public class GracefulEmbeddedServiceShutdownListener implements ApplicationEventListener<ServerShutdownEvent> {
   
   @Inject
   ServiceRegistry serviceRegistry;
   
   @Override
   public void onApplicationEvent(ServerShutdownEvent event) {
       List<LocalServiceState> states = serviceRegistry.all();
       if (states.isEmpty()) {
           return;
       }
       
       List<CompletableFuture<Void>> futures = states.stream()
           .map(state -> CompletableFuture.runAsync(() -> closeService(state), ForkJoinPool.commonPool()))
           .toList();

       // Wait for all services to close, before shutting down the embbeded server
       CompletableFuture.allOf(futures.toArray(new CompletableFuture[0])).join();
   }
   
   private void closeService(LocalServiceState state) {
       final Service service = state.service();
       try {
           service.unwrap().close();
       } catch (Exception e) {
           log.error("[Service id={}, type={}] Unexpected error on close", service.getId(), service.getType(), e);
       }
   }
}

Then, it's OK to keep @PreDestroy on services because the close method must be safe if called twice.

The `@PreDestroy` hook is triggered very late in the lifecycle. Started the graceful clean using the global `ShutdownEvent`.

yuri1969 · 2025-10-13T18:18:28Z

@fhussonnois Thanks, I've tested the ServiceRegistry-based version of yours on both standalone and separated instances.

It does not work as intended when listening to the ServerShutdownEvent event (fired when the EmbeddedServer shuts down) but it does with the ShutdownEvent event (fired prior to starting shutdown sequence).

github-project-automation bot added this to Pull Requests Oct 9, 2025

github-project-automation bot moved this to To review in Pull Requests Oct 9, 2025

MilosPaunovic requested a review from loicmathieu October 10, 2025 06:09

loicmathieu reviewed Oct 10, 2025

View reviewed changes

loicmathieu requested a review from fhussonnois October 10, 2025 07:21

fhussonnois requested changes Oct 10, 2025

View reviewed changes

yuri1969 added 2 commits October 13, 2025 20:01

fix(core): keep web up during graceful shutdown

c9dfc83

The `@PreDestroy` hook is triggered very late in the lifecycle. Started the graceful clean using the global `ShutdownEvent`.

Switch to ServiceRegistry

85b1d83

yuri1969 force-pushed the graceful-shutdown branch from 5216c20 to 85b1d83 Compare October 13, 2025 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(core): keep web up during graceful shutdown #11917

fix(core): keep web up during graceful shutdown #11917

Uh oh!

yuri1969 commented Oct 9, 2025

Uh oh!

loicmathieu left a comment

Uh oh!

fhussonnois left a comment

Uh oh!

yuri1969 commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(core): keep web up during graceful shutdown #11917

Are you sure you want to change the base?

fix(core): keep web up during graceful shutdown #11917

Uh oh!

Conversation

yuri1969 commented Oct 9, 2025

What changes are being made and why?

How the changes have been QAed?

Uh oh!

loicmathieu left a comment

Choose a reason for hiding this comment

Uh oh!

fhussonnois left a comment

Choose a reason for hiding this comment

Uh oh!

yuri1969 commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants