Markov decision processes with uncertain parameters

Scheftelowitsch, Dimitri

Markov decision processes with uncertain parameters

dc.contributor.advisor	Buchholz, Peter
dc.contributor.author	Scheftelowitsch, Dimitri
dc.contributor.referee	Hermanns, Holger
dc.date.accepted	2018-05-03
dc.date.accessioned	2018-07-03T06:59:16Z
dc.date.available	2018-07-03T06:59:16Z
dc.date.issued	2018
dc.description.abstract	Markov decision processes model stochastic uncertainty in systems and allow one to construct strategies which optimize the behaviour of a system with respect to some reward function. However, the parameters for this uncertainty, that is, the probabilities inside a Markov decision model, are derived from empirical or expert knowledge and are themselves subject to uncertainties such as measurement errors or limited expertise. This work considers second-order uncertainty models for Markov decision processes and derives theoretical and practical results. Among other models, this work considers two main forms of uncertainty. One form is a set of discrete scenarios with a prior probability distribution and the task to maximize the expected reward under the given probability distribution. Another form of uncertainty is a continuous uncertainty set of scenarios and the task to compute a policy that optimizes the rewards in the optimistic and pessimistic cases. The work provides two kinds of results. First, we establish complexity-theoretic hardness results for the considered optimization problems. Second, we design heuristics for some of the problems and evaluate them empirically. In the first class of results, we show that additional model uncertainty makes the optimization problems harder to solve, as they add an additional party with own optimization goals. In the second class of results, we show that even if the discussed problems are hard to solve in theory, we can come up with efficient heuristics that can solve them adequately well for practical applications.	en
dc.identifier.uri	http://hdl.handle.net/2003/36946
dc.identifier.uri	http://dx.doi.org/10.17877/DE290R-18945
dc.language.iso	en	de
dc.subject	Optimierung	de
dc.subject	Stochastische Prozesse	de
dc.subject	Robuste Optimierung	de
dc.subject	Unsicherheit	de
dc.subject.ddc	004
dc.subject.rswk	Optimierung	de
dc.subject.rswk	Stochastischer Prozess	de
dc.subject.rswk	Robuste Optimierung	de
dc.subject.rswk	Unsicheres Schließen	de
dc.title	Markov decision processes with uncertain parameters	en
dc.type	Text	de
dc.type.publicationtype	doctoralThesis	de
dcterms.accessRights	open access
eldorado.dnb.deposit	true	de
eldorado.secondarypublication	false	de

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1

Name:: Scheftelowitsch_Dissertation.pdf
Größe:: 1.08 MB
Format:: Adobe Portable Document Format
Beschreibung:: DNB

Herunterladen

Lizenzbündel

Gerade angezeigt 1 - 1 von 1

Name:: license.txt
Größe:: 4.85 KB
Format:: Item-specific license agreed upon to submission
Beschreibung:

Herunterladen

Sammlungen

LS 04 Praktische Informatik