Skip to content

Lecture7: Exploration and Exploitation

Introduction

  • Three broad families:

  • state-action exploration / parameter exploration

Multi-Armed Bandits

  • We don't know where we start.
  • What does the regret look like?

  • not only initialize the value but also the count