The rrllib wrapper should provide a single agent environment with `action_space` and `observation_space` attributes

## 🐛 Bug

Many algorithms based on OpenAI gym domains expect the domain to define the `action_space` and `observation_space` attributes, and especially ray's rllib's single agent algorithms like AlphaZero.

See for instance [ray's rllib's AlphaZero](https://github.com/ray-project/ray/blob/1f0646f65813cc235b1cd9dc05d6d88d732839e9/rllib/contrib/alpha_zero/core/ranked_rewards.py#L37) implementation which makes use of those attributes.

The current implementation of scikit-decide's rllib wrapper provides only a multi-agent environment wrapper via `AsRLlibMultiAgentEnv` which does not define the `action_space` and `observation_space` attributes (which is fine for rllib's multi-agent environments). Therefore scikit-decide's rllib wrapper should additionally provide a single-agent environment wrapper for algorithms like rllib's AlphaZero, which defines the `action_space` and `observation_space` attributes.

## To Reproduce

Define a scikit-decide RL domain and pass it to ray's rllib's AlphaZero algorithm.
The following exception will be thrown when solving the domain:
`AttributeError: 'AsRLlibMultiAgentEnv' object has no attribute 'action_space'`

## Expected behavior

No exception is thrown because an environment wrapper like`AsRLlibSingleAgentEnv` (to be defined) should define the `action_space` and `observation_space` attributes.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The rrllib wrapper should provide a single agent environment with `action_space` and `observation_space` attributes #69

🐛 Bug

To Reproduce

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The rrllib wrapper should provide a single agent environment with action_space and observation_space attributes #69

Description

🐛 Bug

To Reproduce

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

The rrllib wrapper should provide a single agent environment with `action_space` and `observation_space` attributes #69