feat: add script for evaluating a trained agent on a different environment #99

fabioseel · 2025-12-03T12:38:39Z

This enables us to see the performance of eg an agent trained on cifar10 on the test split of the data!

Allows to run a trained agent in an evironment that can be specified to test how well it survives in that.
Usage: python -m runner.scripts.test_agent {path_to_experiment} {env_name} {num_repeats}
Stores the results as a list in data/analyses/survival_durations_{env_name}.csv

(this is equivalent to the description in the file :))

…nment

fabioseel added 2 commits December 2, 2025 17:08

feat: add script for evaluating a trained agent on a different enviro…

1aa8e63

…nment

add documentation

0817117

fabioseel requested a review from alex404 December 3, 2025 12:38

harini-sudha merged commit 89e0f6a into master Dec 15, 2025
5 checks passed

harini-sudha deleted the feat-test_eval branch December 15, 2025 16:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add script for evaluating a trained agent on a different environment #99

feat: add script for evaluating a trained agent on a different environment #99

Uh oh!

fabioseel commented Dec 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add script for evaluating a trained agent on a different environment #99

feat: add script for evaluating a trained agent on a different environment #99

Uh oh!

Conversation

fabioseel commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fabioseel commented Dec 3, 2025 •

edited

Loading