Skip to content

🪶iterated Shared Deep Q-Network [ICLR 26], a new algorithm improving the sample-efficiency of target-free algorithms (e.g. DQN IMPALA) to bridge the gap with target-based algorithms🪶

License

Notifications You must be signed in to change notification settings

theovincent/iS-DQN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

131 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementation of iterated Shared Deep Q-Network (iS-DQN)

custom_badge custom_badge

User installation

We recommend using Python 3.11.5. In the folder where the code is, create a Python virtual environment, activate it, update pip and install the package and its dependencies in editable mode:

python3 -m venv env
source env/bin/activate
pip install --upgrade pip setuptools wheel
pip install -e .[dev,gpu]

To verify the installation, run the tests as:pytest

Running experiments

The script launch_job/atari/launch.sh trains an iS-DQN (K=9) agent with the CNN architecture and LayerNorm on a local machine, on the game Asterix.

About

🪶iterated Shared Deep Q-Network [ICLR 26], a new algorithm improving the sample-efficiency of target-free algorithms (e.g. DQN IMPALA) to bridge the gap with target-based algorithms🪶

Topics

Resources

License

Stars

Watchers

Forks

Contributors 3

  •  
  •  
  •