Ana gezinime geç Aramaya geç Ana içeriğe geç

A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward

  • Eatron Technologies Ltd

Araştırma sonucu: Konferansa katkıYazıbilirkişi

24 Atıf (Scopus)

Özet

Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art \mathbf{Q} learning method, namely Rainbow DQN, that uses a new safety driven rewarding scheme to tackle the issues in an dynamic and uncertain simulation environment. We present various comparative results to show that our novel approach of having reward feedback from the safety layer dramatically increases both the agent's performance and sample efficiency. Furthermore, through the novel deployment of Rainbow DQN, it is shown that more intuition about the agent's actions is extracted by examining the distributions of generated \mathbf{Q} values of the agents. The proposed algorithm shows superior performance to the baseline algorithm in the challenging scenarios with only 200000 training steps (i.e. equivalent to 55 hours driving).

Orijinal dilİngilizce
Sayfalar1156-1161
Sayfa sayısı6
DOI'lar
Yayın durumuYayınlandı - 2020
Etkinlik31st IEEE Intelligent Vehicles Symposium, IV 2020 - Virtual, Las Vegas, United States
Süre: 19 Eki 202013 Kas 2020

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???31st IEEE Intelligent Vehicles Symposium, IV 2020
Ülke/BölgeUnited States
ŞehirVirtual, Las Vegas
Periyot19/10/2013/11/20

Bibliyografik not

Publisher Copyright:
© 2020 IEEE.

Finansman

Ugur Yavas is with Eatron Technologies, Istanbul, Turkey [email protected] Tufan Kumbasar is with Control and Automation Engineering Department, Istanbul Technical University, Turkey [email protected] Nazım Kemal Ure is with Artificial Intelligence and Data Science Research Center and Department of Aeronautical Engineering, Istanbul Technical University, Turkey [email protected] This work was supported by the Research Fund of the Scientific and Technological Research Council of Turkey under Project 118E807.

FinansörlerFinansör numarası
Türkiye Bilimsel ve Teknolojik Araştirma Kurumu118E807
Istanbul Teknik Üniversitesi

    Parmak izi

    A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

    Alıntı Yap