GitHub - BUAA-SIC-LAB/GART-MAPPO

This project implements a Multi-Agent Reinforcement Learning (MARL) solution for drone formation control and gate traversal tasks. It is built upon the Omnidrones framework, leveraging NVIDIA Isaac Sim for high-fidelity physics simulation.

The core contribution is the GRAT-MAPPO (Graph Recurrent Attention Network - Multi-Agent Proximal Policy Optimization) algorithm, designed to enable robust formation control and coordinated maneuver through complex environments.

Note: The project code is currently being organized and will be gradually improved. There may be runtime errors, please fix them yourself.

Overview

Framework: Omnidrones (PyTorch + Isaac Sim)
Algorithm: mappo_graph_attention (MAPPO with Graph Attention and Recurrent units)
Task: FormationGateTraversal (Drones navigating gates while maintaining formation)

Prerequisites

This project requires:

NVIDIA Isaac Sim: Compatible version as required by Omnidrones.
Python: 3.8+
Omnidrones: The base framework code is included in this repository.

Usage

To start training the drone formation policy, run the following command from the project root:

python -u train.py task=FormationGateTraversal algo=mappo_graph_attention

Common Arguments

headless=true: Run simulation without the GUI (useful for remote servers).
wandb.mode=disabled: Disable WandB logging if not needed.
sim.num_envs=...: Set the number of parallel environments.

Example:

python -u train.py task=FormationGateTraversal algo=mappo_graph_attention headless=true

Task Description: FormationGateTraversal

The FormationGateTraversal task challenges a team of drones to fly through a series of gates while maintaining a specific geometric formation.

Objective: Navigate through gates without collision while keeping formation.
Formations: Defined in omni_drones/envs/formation_gate_traversal.py, supports various shapes like Tight, Wide, V-formation, Wedge, etc.
Reward: Based on progress, formation maintenance, alignment, and avoiding collisions.

Algorithm: GRAT-MAPPO

GRAT-MAPPO extends the standard MAPPO algorithm by incorporating:

Graph Neural Networks (GNN): To model the interaction between agents (drones). Each drone is a node, and edges represent communication or proximity.
Attention Mechanism: To dynamically weight the importance of neighboring drones' information.
Recurrent Units (GRU): To handle partial observability and remember past states/trajectories.

Key configuration parameters for the algorithm can be found in cfg/algo/mappo_graph_attention.yaml, such as:

gnn_layers: Number of graph propagation layers.
gnn_heads: Number of attention heads.
seq_len: Sequence length for RNN training.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cfg		cfg
omni_drones		omni_drones
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py
train.py		train.py
train.yaml		train.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Prerequisites

Usage

Common Arguments

Task Description: FormationGateTraversal

Algorithm: GRAT-MAPPO

License

About

Uh oh!

Releases

Packages

Languages

License

BUAA-SIC-LAB/GART-MAPPO

Folders and files

Latest commit

History

Repository files navigation

Overview

Prerequisites

Usage

Common Arguments

Task Description: FormationGateTraversal

Algorithm: GRAT-MAPPO

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages