-
Notifications
You must be signed in to change notification settings - Fork 3
Reinforcement Learning
Jhalak Patel edited this page Sep 20, 2017
·
4 revisions
Reinforcement learning - optimal control, where agent learns to develop an optimal policy of sequential actions to take by interacting with an environment. There are various branches in RL - temporal difference, Monte Carlo and dynamic programming.
Deep Learning is more of complex function approximation for image recognition (CNN), speech (RNN) as well dimension reduction and deep network pretraining (unsupervised).
How deep learning and reinforcement learning combines: eg. deep Q learning, Google DeepMind Atari. Deep neural network is used to approximate the Q function in Q-learning (part of temporal difference learning)