马尔可夫链

Purpose:

  • Markov Process
  • Markov Reward Process
  • Markov Decision Process
  • Extensions to MDPs

Introduction to MDPs

Markov decision process 是为RL描述和框架一个环境。

results matching ""

    No results matching ""