feat(server): deduplicate library and metadata jobs #15955

etnoy · 2025-02-07T22:26:59Z

This adds jobs ids to several jobs which reduces duplicated work. For example, a large library refresh might go on for such a long time that a cron job kicks in and starts refreshing it before the first refresh finishes. This PR prevents it from happening.

I've also added job ids to metadata extraction and thumbnail generation, so we for instance don't queue a thumb generation for the same asset more than once.

danieldietzler

Code LGTM

server/src/services/library.service.ts

server/src/repositories/job.repository.ts

mertalev · 2025-02-08T14:24:32Z

Can you set a job ID only for the queue job and not for the asset-level ones? It should be essentially the same result but much lower impact on queueing behavior.

etnoy · 2025-02-09T23:58:38Z

Can you set a job ID only for the queue job and not for the asset-level ones? It should be essentially the same result but much lower impact on queueing behavior.

The actual work-doing jobs of thumb generation and metadata extraction are the ones that will spend the absolute majority of time in the job queue for large libraries. The queueing jobs dwindle in comparison, and having them only check so that a queueing job already exists makes no difference to what we already have today.

After thinking about this some more I'm more inclined to think that the performance penalty of individual job ids should be worth it. In my current round of testing I've imported 500k assets and after two days I still have 350k thumbs left to generate. That's on a powerful Xeon server with 32 cores. Had I not disabled cron library scanning it would have queued the thumbnail refresh several times over by now.

mertalev · 2025-02-10T08:02:16Z

I'm really not in favor of solving this by multiplying the number of requests by 1000x or even 10000x. Maybe you can check at the start of the queue job if there are any library jobs and abort? This doesn't necessarily need to be solved with job IDs.

zackpollard · 2025-03-03T18:31:56Z

We need a different solution to this, we can do that in a new PR if we figure out a better way, but for now this PR is stalled. We could take a minimal version of this PR with just the library scan job having an id, if you want to do that then just make a new PR with that single change.

etnoy added the changelog:enhancement label Feb 7, 2025

etnoy requested a review from danieldietzler as a code owner February 7, 2025 22:26

github-actions bot added the 🗄️server label Feb 7, 2025

etnoy force-pushed the feat/job-ids branch from c3bba12 to 1dfc0ea Compare February 7, 2025 22:34

danieldietzler approved these changes Feb 7, 2025

View reviewed changes

etnoy force-pushed the feat/job-ids branch 2 times, most recently from f8455c6 to 24e58be Compare February 7, 2025 22:41

mertalev reviewed Feb 7, 2025

View reviewed changes

server/src/services/library.service.ts Outdated Show resolved Hide resolved

mertalev reviewed Feb 7, 2025

View reviewed changes

server/src/repositories/job.repository.ts Outdated Show resolved Hide resolved

feat: add job ids

00ec2bd

etnoy force-pushed the feat/job-ids branch from 24e58be to 00ec2bd Compare February 7, 2025 23:44

zackpollard closed this Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(server): deduplicate library and metadata jobs #15955

feat(server): deduplicate library and metadata jobs #15955

Uh oh!

etnoy commented Feb 7, 2025

Uh oh!

danieldietzler left a comment

Uh oh!

Uh oh!

Uh oh!

mertalev commented Feb 8, 2025

Uh oh!

etnoy commented Feb 9, 2025

Uh oh!

mertalev commented Feb 10, 2025 •

edited

Loading

Uh oh!

zackpollard commented Mar 3, 2025

Uh oh!

Uh oh!

Uh oh!

feat(server): deduplicate library and metadata jobs #15955

feat(server): deduplicate library and metadata jobs #15955

Uh oh!

Conversation

etnoy commented Feb 7, 2025

Uh oh!

danieldietzler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mertalev commented Feb 8, 2025

Uh oh!

etnoy commented Feb 9, 2025

Uh oh!

mertalev commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zackpollard commented Mar 3, 2025

Uh oh!

Uh oh!

mertalev commented Feb 10, 2025 •

edited

Loading