✨ MDP Path-finding - Hexbot AI

Description:

This project is developed to assess the understanding of fundamental concepts of reinforcement learning. This fundamental concepts include Markov decision processes, State-Value Functions, Action-Value functions, Policies, Bellman equations, Policy iteration and Value iteration.

Problem Statement

The hexbot world is made of hexagonal tiles as shown in Fig. 1. There is one robot (we call it Hexbot) and there are one/multiple objects. Some tiles on the world are targets and some are hazards. The task is we need to train the hexbot to pick all the objects and put them onto target positions while avoiding any collision with other objects as well as avoiding hazards.

Figure - 1 (Hexbot environment)

The Hexbot is denoted using (R *) in a tile. Here the * denotes the direction which hexbot is facing. The hazards are shown using 'x' and targets are denoted using 'tgt'. Various objects are denoted using alphabets where capital alphabets denote the center of respective object.

Warning

Moreover, there are some stochasticity in the environment. Whenever the robot decides to perform an action, there are chances that it may drift left / right (but not both) before performing the move. There are also chances that the action will be performed twice.

Possible robot orientations are shown below

Objects can be of three types.

Object type	Orientations
3 Tile object
4 Tile object
5 Tile object

Solution:

We consider this problem as a markov decision process and use the value iteration and policy iteration algorithms to find an optimal policy.

Demo:

output.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.idea		.idea
control		control
docs		docs
testcases		testcases
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
environment.py		environment.py
play.py		play.py
solution.py		solution.py
state.py		state.py
tester.py		tester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

✨ MDP Path-finding - Hexbot AI

Description:

Problem Statement

Solution:

Demo:

About

Uh oh!

Releases

Packages

Languages

bhaveshachhada/rl-hexbot

Folders and files

Latest commit

History

Repository files navigation

✨ MDP Path-finding - Hexbot AI

Description:

Problem Statement

Solution:

Demo:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages