Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/8205
Title: A non-stationary infinite partially-observable markov decision process
Authors: Kosmopoulos, Dimitrios I. 
Chatzis, Sotirios P. 
Major Field of Science: Engineering and Technology
Field Category: Computer and Information Sciences
Keywords: Markov decision processes;Bayesian methods
Issue Date: 2014
Source: 24th International Conference on Artificial Neural Networks, Hamburg, Germany, September 15-19, 2014. Proceedings, pp. 355-362
Conference: International Conference on Artificial Neural Networks 
Abstract: Partially Observable Markov Decision Processes (POMDPs) have been met with great success in planning domains where agents must balance actions that provide knowledge and actions that provide reward. Recently, nonparametric Bayesian methods have been success- fully applied to POMDPs to obviate the need of a priori knowledge of the size of the state space, allowing to assume that the number of visited states may grow as the agent explores its environment. These approaches rely on the assumption that the agent's environment remains stationary; however, in real-world scenarios the environment may change over time. In this work, we aim to address this inadequacy by introducing a dynamic nonparametric Bayesian POMDP model that both allows for automatic inference of the (distributional) representations of POMDP states, and for capturing non-stationarity in the modeled environments. Formulation of our method is based on imposition of a suitable dynamic hierarchical Dirichlet process (dHDP) prior over state transitions. We derive e cientalgorithms for model inference and action planning and evaluate it on several benchmark tasks.
URI: https://hdl.handle.net/20.500.14279/8205
ISBN: 978-3-319-11179-7 (online)
DOI: 10.1007/978-3-319-11179-7_45
Type: Conference Papers
Affiliation : Cyprus University of Technology 
Hellenic Mediterranean University 
Appears in Collections:Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

Files in This Item:
File Description SizeFormat
Chatzis.pdf196.62 kBAdobe PDFView/Open
CORE Recommender
Show full item record

SCOPUSTM   
Citations

1
checked on Nov 9, 2023

Page view(s) 20

481
Last Week
0
Last month
1
checked on Dec 3, 2024

Download(s)

285
checked on Dec 3, 2024

Google ScholarTM

Check

Altmetric


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.