Automated fine tuning via post-training online mining #2157

talmo · 2025-04-07T21:20:48Z

talmo
Apr 7, 2025
Maintainer

We have online mining (OHKM) which increases the loss weight on a per-node basis to encourage the optimization to focus on "hard" nodes even if the overall loss is low.

Turning this on early in training can often lead to instabilities, but turning it on in a second training run initialized with the weights for the first training run tends to work well.

It would be great to have a "second phase" training in which OHKM is enabled and the learning rate reset after the first run converges.

It might be easier to set it up as a second training run that runs in a sequence, at the cost of some orchestration complexity and re-initialization overhead. This is how multi-model training runs work (e.g., centroid -> centered instance).

Alternatively, handling the restarting logic internally as part of the same training run would be cleaner on the frontend, but might require a soft layer of orchestration (above Trainer, but in the same process).

Idea credit: @olinesn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated fine tuning via post-training online mining #2157

{{title}}

Replies: 0 comments

Select a reply

Automated fine tuning via post-training online mining #2157

talmo Apr 7, 2025 Maintainer

Replies: 0 comments

talmo
Apr 7, 2025
Maintainer