Staged Experience Mechanism (SEM)

This repository is the source code of paper SEM: Adaptive Staged Experience Access Mechanism for Reinforcement Learning of ICTAI 2020.

Getting started

"""
Usage:
    python [options]

Options:
    -h,--help                   Help
    -i,--inference              Inference mode [default: False]
    -a,--algorithm=<name>       Specify training algorithm [default: ppo]
    -c,--config-file=<file>     Specify the hyper-parameter configuration file for the model [default: None]
    -e,--env=<file>             Specify the unity environment name [default: None]
    -p,--port=<n>               Specify port [default: 5005]
    -u,--unity                  Whether to use the unity client [default: False]
    -g,--graphic                Whether to display graphical interface [default: False]
    -n,--name=<name>            Specify the name of this training [default: None]
    -s,--save-frequency=<n>     Specify the frequency for saving model [default: None]
    -m,--models=<n>             How many models to train at the same time [default: 1]
    --store-dir=<file>          Specify the folder path to save the model, log, and data [default: None]
    --seed=<n>                  Specify the random seed of the model [default: 0]
    --max-step=<n>              Maximum time step per episode [default: None]
    --max-episode=<n>           Total training episodes [default: None]
    --sampler=<file>            Specify the file path for the random sampler for Unity [default: None]
    --load=<name>               Specify the name of the training to load the model [default: None]
    --fill-in                   Specify whether to pre-populate the experience pool to batch_size [default: False]
    --prefill-choose            Specify whether to choose action while pre-populate the experience pool [default: False]
    --gym                       Whether to use a gym training environment [default: False]
    --gym-agents=<n>            Specify the amount of parallel training [default: 1]
    --gym-env=<name>            Specify the name of the gym environment [default: CartPole-v0]
    --gym-env-seed=<n>          Specify random seed for gym environment [default: 0]
    --render-episode=<n>        Specify when the gym environment starts rendering [default: None]
    --info=<str>                Write a description of the training, wrapped in double quotation marks [default: None]
    --sem                       Whether to use SEM or not [default: False]
Example:
    python run.py --gym --gym-env Hopper-v2 -a td3 -n test --seed 0
"""

Usage

Train with SEM:

python run.py --gym --gym-env [env_id] -a [algo_name] -n [training_name] --sem

Inference Policies:

python run.py --gym --gym-env [env_id] -a [algo_name] -n [training_name] -i

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Algorithms		Algorithms
Nn		Nn
common		common
gym		gym
gym_wrapper		gym_wrapper
mlagents		mlagents
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Staged Experience Mechanism (SEM)

Getting started

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Staged Experience Mechanism (SEM)

Getting started

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages