You are on page 1of 80

Locally Weighted Regression

Introduction to Reinforcement Learning


Scope of Reinforcement Learning
Q Learning

• Q-learning is a machine learning approach that enables a model to


iteratively learn and improve over time by taking the correct action.
Q-learning is a type of reinforcement learning. With reinforcement
learning, a machine learning model is trained to mimic the way
animals or children learn.
Q Learning: (Q-Learning Algorithm)
. Convergence
Will the Q Learning Algorithm converge toward a Q equal to the true Q
function?
Yes, under certain conditions.

i. Assume the system is a deterministic MDP(Markov decision process)

ii. Assume the immediate reward values are bounded; that is, there exists some

positive constant c such that for all states s and actions a, | r(s, a)| < c

iii. Assume the agent selects actions in such a fashion that it visits every
possible state-action pair infinitely often
Table 5.1. Values of ZN for two-sided N% confidence intervals
5.3.6. Two-Sided and One-Sided Bounds
Expectation maximization algorithm

You might also like