Skip to content

sync latest best policy training code#537

Merged
codekansas merged 17 commits intomasterfrom
kbot-joystick-dev-bart
Oct 29, 2025
Merged

sync latest best policy training code#537
codekansas merged 17 commits intomasterfrom
kbot-joystick-dev-bart

Conversation

@b-vm
Copy link
Contributor

@b-vm b-vm commented Oct 15, 2025

No description provided.

@b-vm b-vm changed the title sync latest best policy training code [wip still adding stuff] sync latest best policy training code Oct 15, 2025
@b-vm
Copy link
Contributor Author

b-vm commented Oct 15, 2025

@codekansas this adds everything i think. does not run in current state.

@b-vm
Copy link
Contributor Author

b-vm commented Oct 16, 2025

the mirror losses are not used currently

@codekansas codekansas force-pushed the kbot-joystick-dev-bart branch from e56a771 to 792bbeb Compare October 27, 2025 23:59
@codekansas codekansas force-pushed the kbot-joystick-dev-bart branch from 792bbeb to ea1ad0a Compare October 28, 2025 00:00
@codekansas
Copy link
Member

codekansas commented Oct 28, 2025

fixed the kscale api... can just use that instead i think

also i think distrax multivariatenormaldiag is basically the same as the xax normal distribution implementation

@codekansas codekansas force-pushed the kbot-joystick-dev-bart branch from aa92a19 to f0bfb4a Compare October 28, 2025 07:47
@codekansas codekansas merged commit 52265b5 into master Oct 29, 2025
2 checks passed
@codekansas codekansas deleted the kbot-joystick-dev-bart branch October 29, 2025 07:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants