Skip to content

Conversation

@pkooij
Copy link
Member

@pkooij pkooij commented Dec 5, 2025

No description provided.

pkooij and others added 11 commits December 1, 2025 11:31
* Add next observation loading for RA-BC progress deltas

* Compute weights based on temporal progress deltas instead of static rewards

* Add hard-masking for negative progress deltas in weight computation
* Add dual dense sparse head and annotation

* Add docs

* add dual to procesor

* cleanup

* change sampling in visualize and cleanup

* remove validation

* remove compile

* Feat/test uniform (#2587)

* test uniform

* add different string for misaligned
@pkooij pkooij changed the title Add SARM Add SARM (Reward + RA-BC) Dec 5, 2025
@pkooij pkooij self-assigned this Dec 5, 2025
@pkooij pkooij added the policies Items related to robot policies label Dec 5, 2025
@pkooij pkooij assigned pkooij and unassigned pkooij Dec 5, 2025
@pkooij pkooij changed the title Add SARM (Reward + RA-BC) Add SARM (Annotate + Reward + RA-BC) Dec 5, 2025
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@xianglunkai
Copy link

@pkooij
Very great work!
similiar to Pi0.6* ideas for improving VLA performance

pkooij and others added 7 commits December 6, 2025 09:17
* update rabc implementation

* compute rabc beforehand

* fix import

* add only progress calulation

* use precomputed progress

* multi gpu processing

* import

* fix dataset meta data extraction

* add logging

* logging

* log

* progress per episode

* split differently

* move clip to gpu

* pre decode frames for an episode

* fix cuda initalization

* fix import

* multi processing

* rename

* fix import

* fix

* fix rabc

* use last known progress if oob

* use last known progress if oob
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

policies Items related to robot policies

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants