Skip to content

Lecture6: Value Function Approximation

Introduction

  • How can we scale up the model-free methods for prediction and control from the last two lectures?

  • What we'll do is not just reduce the memory but also allow us to generalize -- to fit our function to approximate our states that we've never seen.

  • Consider this action here, how could that be?

Incremental Methods

Batch Method