Skip to content

✨ Add RL truncation #678

@flowerthrower

Description

@flowerthrower

Currently, the agent might get stuck in extremely long intermediate state loops that don't improve the trajectory. Similarly, some heuristic passes do not always find a solution, or they may timeout - blocking training.

Add a customizable step limit and truncation for failed/timeout passes to speed up training.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions