Adaptive planning for markov decision processes with uncertain transition models via incremental feature dependency discovery

N. Kemal Ure*, Alborz Geramifard, Girish Chowdhary, Jonathan P. How

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Citations (Scopus)

Abstract

Solving large scale sequential decision making problems without prior knowledge of the state transition model is a key problem in the planning literature. One approach to tackle this problem is to learn the state transition model online using limited observed measurements. We present an adaptive function approximator (incremental Feature Dependency Discovery (iFDD)) that grows the set of features online to approximately represent the transition model. The approach leverages existing feature-dependencies to build a sparse representation of the state transition model. Theoretical analysis and numerical simulations in domains with state space sizes varying from thousands to millions are used to illustrate the benefit of using iFDD for incrementally building transition models in a planning framework.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Proceedings
Pages99-115
Number of pages17
EditionPART 2
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012 - Bristol, United Kingdom
Duration: 24 Sept 201228 Sept 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7524 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012
Country/TerritoryUnited Kingdom
CityBristol
Period24/09/1228/09/12

Fingerprint

Dive into the research topics of 'Adaptive planning for markov decision processes with uncertain transition models via incremental feature dependency discovery'. Together they form a unique fingerprint.

Cite this