Učitavanje...

Optimistic Value Iteration

Markov decision processes are widely used for planning and verification in settings that combine controllable or adversarial choices with probabilistic behaviour. The standard analysis algorithm, value iteration, only provides lower bounds on infinite-horizon probabilities and rewards. Two “sound” v...

Cijeli opis

Spremljeno u:
Bibliografski detalji
Izdano u:Computer Aided Verification
Glavni autori: Hartmanns, Arnd, Kaminski, Benjamin Lucien
Format: Artigo
Jezik:Inglês
Izdano: 2020
Teme:
Online pristup:https://ncbi.nlm.nih.gov/pmc/articles/PMC7363440/
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1007/978-3-030-53291-8_26
Oznake: Dodaj oznaku
Bez oznaka, Budi prvi tko označuje ovaj zapis!