Özet
Reinforcement learning is a learning scheme for an autonomous agent that allows the agent to find the optimal policy of taking actions which maximize a scalar reinforcement signal in unknown environments. If the agent has access to the whole state of the environment, a reactive policy which maps the sensory input to the action is sufficient. However, if the state of the environment is partially observable, special methods for creating a dynamic policy that utilizes the past observations are necessary. To overcome this problem, the authors have proposed a method using recurrent neural networks with Q-learning, as a learning agent. This paper compares several types of network architecture and learning algorithms for this method through computer simulation. Further, the internal representation in the trained networks is examined using a clustering technique. It shows that the representation of the environmental state is developed well in the networks.
| Orijinal dil | İngilizce |
|---|---|
| Sayfalar | 2010-2015 |
| Sayfa sayısı | 6 |
| Yayın durumu | Yayınlandı - 1998 |
| Harici olarak yayınlandı | Evet |
| Etkinlik | Proceedings of the 1998 IEEE International Joint Conference on Neural Networks. Part 1 (of 3) - Anchorage, AK, USA Süre: 4 May 1998 → 9 May 1998 |
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | Proceedings of the 1998 IEEE International Joint Conference on Neural Networks. Part 1 (of 3) |
|---|---|
| Şehir | Anchorage, AK, USA |
| Periyot | 4/05/98 → 9/05/98 |
Parmak izi
Recurrent neural networks for reinforcement learning: Architecture, learning algorithms and internal representation' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver