Please use this identifier to cite or link to this item:
|Title:||A partially-observable markov decision process for dealing with dynamically changing environments||Authors:||Chatzis, Sotirios P.
Kosmopoulos, Dimitrios I.
|Keywords:||POMDP problem;Bayesian non-parametrics;Dirichlet process||Category:||Electrical Engineering - Electronic Engineering - Information Engineering||Field:||Engineering and Technology||Issue Date:||2014||Source:||10th International Conference on Artificial Intelligence Applications and Innovations, 2014, Island of Rhodes, Greece, 19-21 September||Abstract:||This paper offers a solution to the non-stationary POMDP problem, by making use of methods and concepts from the field of Bayesian non-parametrics, specifically dynamic hierarchical Dirichlet process priors. We combine block Gibbs sampling and importance sampling to perform inference. We evaluate the method in several benchmark policy learning tasks||URI:||http://ktisis.cut.ac.cy/handle/10488/8206||Type:||Conference Papers|
|Appears in Collections:||Δημοσιεύσεις σε συνέδρια/Conference papers|
Show full item record
Page view(s) 5062
checked on Dec 11, 2018
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.