Ana gezinime geç Aramaya geç Ana içeriğe geç

Adaptive planning for markov decision processes with uncertain transition models via incremental feature dependency discovery

  • N. Kemal Ure*
  • , Alborz Geramifard
  • , Girish Chowdhary
  • , Jonathan P. How
  • *Bu çalışma için yazışmadan sorumlu yazar
  • Massachusetts Institute of Technology

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

10 Atıf (Scopus)

Özet

Solving large scale sequential decision making problems without prior knowledge of the state transition model is a key problem in the planning literature. One approach to tackle this problem is to learn the state transition model online using limited observed measurements. We present an adaptive function approximator (incremental Feature Dependency Discovery (iFDD)) that grows the set of features online to approximately represent the transition model. The approach leverages existing feature-dependencies to build a sparse representation of the state transition model. Theoretical analysis and numerical simulations in domains with state space sizes varying from thousands to millions are used to illustrate the benefit of using iFDD for incrementally building transition models in a planning framework.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Proceedings
YayınlayanSpringer Verlag
Sayfalar99-115
Sayfa sayısı17
BaskıPART 2
ISBN (Basılı)9783642334856
DOI'lar
Yayın durumuYayınlandı - 2012
Harici olarak yayınlandıEvet
Etkinlik12th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2012 - Bristol, United Kingdom
Süre: 24 Eyl 201228 Eyl 2012

Yayın serisi

AdıLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SayıPART 2
Hacim7524 LNAI
ISSN (Basılı)0302-9743
ISSN (Elektronik)1611-3349

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???12th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2012
Ülke/BölgeUnited Kingdom
ŞehirBristol
Periyot24/09/1228/09/12

Parmak izi

Adaptive planning for markov decision processes with uncertain transition models via incremental feature dependency discovery' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap