-
-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Labels
Description
The current code solves dynamic programs with
- reward function
f(s, x)and - state transition function
g(s, x, e)
where
sis anN-dimensional continuous state variablexis a 1-dimensional continuous action variable andeis a discrete random variable, whose distribution is state independent.
Discussion:
- Do we want to allow discrete state variables in some dimensions?
- We want to allow discrete actions (Allow discrete actions #5) and multidimensional actions.
- Do we want to allow state dependent distributions?