🌱 Jongmin Lee

Search

Recent Notes

What is Reinforcement Learning?
Apr 16, 2024
- DeepLearningAI
- MachineLearningSpecialization
Index
Apr 16, 2024
Index
Apr 16, 2024

❯

❯

Machine Learning Specialization

❯

What is Reinforcement Learning?

What is Reinforcement Learning?

Apr 16, 20241 min read

DeepLearningAI
MachineLearningSpecialization

Reinforcement Learning

x → y
- x: reward or the reward function
- positive reward / negative reward

Applications

Controlling robots
Factory optimization
Financial (stock) trading
Playing games (including video games)

Mars rover example

The return in reinforcement learning

Return = $R_{1} + r R_{2} + r^{2} R_{3} + ...$ (until terminal state)
Discount Factor: $r = 0.9$
- $0 + (0.9) 0 + (0.9)^{2} 0 + (0.9)^{3} 100$
The return depends on the actions you take

Making decisions: Policies in reinforcement learning

Policy

A policy is a function mapping from states to actions, that tells you what action to take in a given state
- $π (s) = a$
- $π$ : policy
- $s$ : state
- $a$ : action

The goal of reinforcement learning

Find a policy $π$ that tells you what action $(a = π (s))$ to take in every state $s$ so as to maximize the return

Review of key concepts

Markov Decision Process (MDP)

Reinforcement Learning
Applications
Mars rover example
The return in reinforcement learning
Making decisions: Policies in reinforcement learning
Policy
The goal of reinforcement learning
Review of key concepts
Markov Decision Process (MDP)

Graph View

Backlinks

Unsupervised Learning

Created with Quartz v4.2.3 © 2024

GitHub
Discord Community