Skip to content

Formulation #6

@oyamad

Description

@oyamad

The current code solves dynamic programs with

  • reward function f(s, x) and
  • state transition function g(s, x, e)

where

  1. s is an N-dimensional continuous state variable
  2. x is a 1-dimensional continuous action variable and
  3. e is a discrete random variable, whose distribution is state independent.

Discussion:

  1. Do we want to allow discrete state variables in some dimensions?
  2. We want to allow discrete actions (Allow discrete actions #5) and multidimensional actions.
  3. Do we want to allow state dependent distributions?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions