机器学习 13 MDP cont.

Lesson 20.

1. POMDP : partially observable Markov decision processes 

2. pegasus policy search: A policy search method for large MDPs and POMDPs

http://vorlon.case.edu/~sray/mlrg/pegasus.pdf

你可能感兴趣的:(机器学习 13 MDP cont.)