Učitavanje...
Optimistic Value Iteration
Markov decision processes are widely used for planning and verification in settings that combine controllable or adversarial choices with probabilistic behaviour. The standard analysis algorithm, value iteration, only provides lower bounds on infinite-horizon probabilities and rewards. Two “sound” v...
Spremljeno u:
| Izdano u: | Computer Aided Verification |
|---|---|
| Glavni autori: | , |
| Format: | Artigo |
| Jezik: | Inglês |
| Izdano: |
2020
|
| Teme: | |
| Online pristup: | https://ncbi.nlm.nih.gov/pmc/articles/PMC7363440/ https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1007/978-3-030-53291-8_26 |
| Oznake: |
Dodaj oznaku
Bez oznaka, Budi prvi tko označuje ovaj zapis!
|