Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1

What is Reinforcement Learning(RL)

How an intelligent agent learns to make good sequences of decisions according to repeated interactions with World

Key aspects of RL

Comparing RL with similar AI procedures

Sequential Decision Making

캡처.JPG

Goal : compose set of actions to maximize total expected future reward

→ may require strategic behavior to achieve max rewards(need to balance between immediate & long term rewards)

Agent & World Interaction(Discrete Time)

캡처2.JPG

Each time step $t$ :