載入...
Optimistic Value Iteration
Markov decision processes are widely used for planning and verification in settings that combine controllable or adversarial choices with probabilistic behaviour. The standard analysis algorithm, value iteration, only provides lower bounds on infinite-horizon probabilities and rewards. Two “sound” v...
Na minha lista:
| 發表在: | Computer Aided Verification |
|---|---|
| Main Authors: | , |
| 格式: | Artigo |
| 語言: | Inglês |
| 出版: |
2020
|
| 主題: | |
| 在線閱讀: | https://ncbi.nlm.nih.gov/pmc/articles/PMC7363440/ https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1007/978-3-030-53291-8_26 |
| 標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|