🏗️ DaJax: Data Collection in the JAX ecosystem🏗️

From Policy Weights to Datasets!

📊 Data Collection

DaJax provides robust tools for collecting and managing data from reinforcement learning environments.

Tools to help you or your LLM of choice to write better and more helpful data collection scripts:

Hopper Rollouts for 1.1 episodes at various policy checkpoint from randomly initialized policy

Use

To get policy rollouts using an Actor-Critic network:

collect_brax.py for environments based on the Brax Physics Enginge

collect_discrete.py for discrete (Categorical) action space on environments using Gymnax

collect_continuous.py for a continuous (MultivariateNormalDiag) action space on environments using Gymnax

Utils Integration

The collectors leverage utility functions for:

Efficient JAX-based data buffering and storage
Vectorized environment stepping
Batched policy evaluation

This modular design allows for flexible data collection while maintaining JAX's performance benefits and functional programming paradigm.

Outputs

Episodes associated to a particular checkpoint during the policy training in the same file
Corresponding image to see that the terminations, observations and actions are aligned

Examples

The files in the main directory save the results here:

CSV files - each row is a tuple of $(a_t, o_t, o_{t+1}, \text{Done}, r_{t+1})$ where:

$a_t$ is the action at time $t$
$o_t$ is the observation at time $t$
$o_{t+1}$ is the next observation
$\text{Done}$ is the terminal state flag
$r_{t+1}$ is the reward received

Verification Media - media to check that the data makes sense

🔩 Installation

You can install DaJax and its dependencies using pip. Install all dependencies at once using:

Basic Installation

pip install -r setup/requirements-base.txt

For CUDA Support

pip install -r setup/docker/requirements-cuda.txt

For CPU-only

pip install -r setup/docker/requirements-cpu.txt

🐳 Running Via Docker

Build the Docker container with the provided script:

cd setup/docker && ./build.sh

Add your WandB key to the lstm/setup/docker folder:

echo <wandb_key> > setup/docker/wandb_key

🐍 Running Via Conda

conda env create -f setup/environment.yml

👼 just add a wandb_key file without any extensions containing the key from the link above. the .gitignore is set up to ignore it and ensure the privacy of your key and your data.

📝 To be added

Dataset Format Conversions:
- D4RL format conversion utilities
- Minari format conversion support
- One-step dynamics training data format
Would be nice:
- Documentation for format conversion workflows
- Weights and Biases Integration

😬 I will reorganize soon once I receive more feedback regarding the best ways people like to use such tools.

Acknowledgement

These tools are based on the following:

🚀 Jax Ecosystem

💪 Gymnax

🌟 PureJaxRL

Citation

If you use DaJax in your research, please cite:

@software{dajax2024,
  author       = {Uljad Berdica},
  title        = {DaJax: Data Collection in the JAX ecosystem},
  year         = {2024},
  publisher    = {GitHub},
  url          = {https://github.com/rodrigodelazcano/DaJax},
  description  = {A JAX-based library for collecting and managing reinforcement learning datasets}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
configs		configs
data		data
setup		setup
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
animation.gif		animation.gif
animation_new.gif		animation_new.gif
collect_brax.py		collect_brax.py
collect_continuous.py		collect_continuous.py
collect_discrete.py		collect_discrete.py
run_docker.sh		run_docker.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🏗️ DaJax: Data Collection in the JAX ecosystem🏗️

From Policy Weights to Datasets!

📊 Data Collection

Use

Utils Integration

Outputs

Examples

🔩 Installation

Basic Installation

For CUDA Support

For CPU-only

🐳 Running Via Docker

🐍 Running Via Conda

📝 To be added

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

uljad/DaJax

Folders and files

Latest commit

History

Repository files navigation

🏗️ DaJax: Data Collection in the JAX ecosystem🏗️

From Policy Weights to Datasets!

📊 Data Collection

Use

Utils Integration

Outputs

Examples

🔩 Installation

Basic Installation

For CUDA Support

For CPU-only

🐳 Running Via Docker

🐍 Running Via Conda

📝 To be added

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages