feat: add FFT trainer worker class by ShubyM · Pull Request #113 · gke-labs/open-rl

ShubyM · 2026-06-05T17:29:09Z

This PR splits the existing trainer into the shape we need for full fine-tuning. The old trainer was really a LoRA worker because one process owns a base model and serves multiple jobs by creating and switching adapters. Full fine-tuning has a different ownership model where one worker process owns one trainable model for one job. To make that distinction explicit, this introduces a shared BaseTrainerWorker, keeps LoRA-specific adapter management in LoraTrainingWorker, and adds an FFTTrainingWorker for the single-model full fine-tuning path.

Most of the actual training math is shared between the two modes, so this also moves common forward/backward, padding, logprob, batching, and generation code into the base worker. The loss functions are factored into pure tensor operations in losses.py, which makes them easier to test directly and keeps the worker classes focused on model lifecycle and orchestration.

This PR does not wire full fine-tuning into the API server yet. It just adds the worker split and shared math needed for that follow-up.

droot

This is great refactor! Thanks!

ShubyM added 3 commits June 5, 2026 00:26

refactor: move trainer and extract losses

2a10a22

refactor: split base and lora trainer workers

3c8a665

feat: add fft trainer worker

00c4f00

ShubyM requested a review from droot June 5, 2026 17:29

droot approved these changes Jun 5, 2026

View reviewed changes

ShubyM merged commit fed9fd7 into gke-labs:main Jun 5, 2026
11 checks passed

ShubyM mentioned this pull request Jun 8, 2026

feat: FFT snapshot integration #116

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add FFT trainer worker class#113

feat: add FFT trainer worker class#113
ShubyM merged 3 commits into
gke-labs:mainfrom
ShubyM:feat/fft-worker

ShubyM commented Jun 5, 2026

Uh oh!

droot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ShubyM commented Jun 5, 2026

Uh oh!

droot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants