Skip to content

Final Project for Columbia University: EECSE6892 TOPICS in Information Processing- Reinforcement Learning, Spring 2025

Notifications You must be signed in to change notification settings

onurcalisir27/RL_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Waypoint-Based Reinforcement Learning for Robot Manipulation

This repository implements a waypoint-based reinforcement learning approach, a Soft Actor-Critic (SAC) baseline, as described in the paper "Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks" by Mehta et al. The implementation uses TensorFlow and is designed to work with the robosuite simulation environment.

Here's a demonstration of the robot performing the "pick and place" task:

Robot Task

Usage

Training

  1. Configure the task and method: Edit cfg/config.yaml to set the task, method, and parameters. For example, to train the waypoint-based method on the Lift task:

    env_name: Lift
    object: ''
    num_wp: 2
    run_name: 'test'
    n_inits: 5
    render: False
    train: True
    test: False
    method: 'waypoint'

    For SAC:

    method: 'sac'
    train: True
    test: False
  2. Run the training:

    python main.py

    This will train the specified method, saving models and training data to models/{env}/{run_name}/ or models/{env}/{object}/{run_name}/.

  3. Monitor training: Training progress is logged to TensorBoard. View the logs using:

    tensorboard --logdir runs/

    Open the provided URL (usually http://localhost:6006) in a browser.

Evaluation

  1. Configure evaluation: Update cfg/config.yaml to enable evaluation:

    train: False
    test: True
    method: 'waypoint'  # or 'sac'
  2. Run the evaluation:

    python main.py

    This will evaluate the trained model, saving results to models/{env}/{run_name}/eval_data.pkl (waypoint) or eval_reward.pkl (SAC).

About

Final Project for Columbia University: EECSE6892 TOPICS in Information Processing- Reinforcement Learning, Spring 2025

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages