Deadfish🐟 Studying Free

Skip to content

Deadfish🐟 Studying Free

Main Navigation Home Reinforcement Learning Cognitive Behavioral Computing ML Derivation

Appearance

Sidebar Navigation

Reinforcement Learning

Overview

Lecture 1: Intro to RL

Lecture 2

Lecture 3

Lecture 4

Lecture 5

Lecture 6

Lecture 7

Agents

On this page

Lecture7: Exploration and Exploitation

Introduction

Three broad families:

state-action exploration / parameter exploration

Multi-Armed Bandits

We don't know where we start.
What does the regret look like?

not only initialize the value but also the count

Pager

Previous pageLecture 6

Next pageAgents