In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning.
This course is part of the Reinforcement Learning Specialization
Offered By
About this Course
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Skills you will gain
- Artificial Intelligence (AI)
- Machine Learning
- Reinforcement Learning
- Function Approximation
- Intelligent Systems
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Syllabus - What you will learn from this course
Welcome to the Course!
Monte Carlo Methods for Prediction & Control
Temporal Difference Learning Methods for Prediction
Temporal Difference Learning Methods for Control
Planning, Learning & Acting
Reviews
- 5 stars81.92%
- 4 stars13.62%
- 3 stars2.88%
- 2 stars0.61%
- 1 star0.96%
TOP REVIEWS FROM SAMPLE-BASED LEARNING METHODS
Great course - well paced, with the right material. And the professors deliver content in a structured way, which makes it easier to understand complex concepts.
The lectures and quiz tests are perfect. Jupyter. Programming exercises can be a little confusing sometimes but are also great. A great course, overall.
Good balance of theory and programming assignments. I really like the weekly bonus videos with professors and developers. Recommend to everyone.
Great course. Clear, concise, practical. Right amount of programming. Right amount of tests of conceptual knowledge. Almost perfect course.
About the Reinforcement Learning Specialization

Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
What is the refund policy?
Is financial aid available?
More questions? Visit the Learner Help Center.