基于ARMA车速预测的智能车交叉口强化学习决策研究
作者单位:

重庆大学

基金项目:

重庆市基础研究与前沿探索项目(CSTC2018JCYJAX0409);四川省科技计划资助项目(2019 YFG0528)


Research on reinforcement learning autonomous vehicle decision making at intersection based on ARMA speed forecasting model
Affiliation:

ChongQing University

Fund Project:

Supported by Chongqing Basic Research and Frontier Exploration Project(CSTC2018JCYJAX0409); SiChuan Science and Technology funding project(2019YFG0528).

  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [18]
  • |
  • 相似文献
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    为解决无信号交叉口的智能车决策控制问题,以双向单车道交叉口下两车合流工况为对象,采用强化学习方法开展研究,建立车辆状态空间到动作空间的映射关系。针对目前研究中环境车辆车速设置过于简单的问题,以实际场景下采集的数据作为环境车辆的轨迹信息以构建场景模型。基于自回归滑动平均模型对环境车辆的车速进行预测。结合智能车以及所预测的环境车辆车速时序信息建立先行让行决策模型以计算本车参考车速,引入参考车速构建强化学习的奖励函数加速训练收敛速度。结果表明:所提出的强化学习模型具有较快的收敛速度,训练得到的智能体在与不同驾驶风格的环境车辆博弈时能够安全通过交叉口,为无信号交叉口智能车安全通行决策控制问题提供了参考依据。

    Abstract:

    In order to solve the problem of autonomous vehicle decision and control at an unsignalized intersection, this paper studies the confluence condition of two vehicles at a two-way single-lane intersection. The method of reinforcement learning is used for the decision and control of autonomous vehicle to establish the mapping relationship between vehicle state space and action space. Aiming at the problem that the vehicle speed setting is too simple in the present study, the data collected in the actual scene was used as the trajectory information of the environmental vehicle to construct the scene model. Based on the autoregressive moving average model, the vehicle speed of environmental vehicle is predicted. Combining with the autonomous vehicle and the predicted environmental vehicle speed timing information, the forward decision-making model is established to calculate the reference speed, and the reference speed is introduced to construct the reinforcement learning reward function to accelerate the training convergence speed. The results show that the enhanced learning model has a fast convergence speed, and the trained agent can safely pass through the intersection when gambling with environmental vehicles with different driving styles, this paper provides a reference for autonomous vehicle safety decision-making and control at no-signal intersection.

    参考文献
    [1] SHIRAZI M S, MORRIS B T. Looking at Intersections: A Survey of Intersection Monitoring, Behavior and Safety Analysis of Recent Studies [J]. Ieee Transactions on Intelligent Transportation Systems, 2017, 18(1): 4-24
    [2] HULT R, ZANON M, GROS S, et al. Optimal Coordination of Automated Vehicles at Intersections: Theory and Experiments [J]. IEEE Transactions on Control Systems Technology, 2019, 27(6): 2510-25
    [3] MA L, XUE J R, KAWABATA K, et al. Efficient Sampling-Based Motion Planning for On-Road Autonomous Driving [J]. Ieee Transactions on Intelligent Transportation Systems, 2015, 16(4): 1961-76
    [4] NOH S. Decision-Making Framework for Autonomous Driving at Road Intersections: Safeguarding Against Collision, Overly Conservative Behavior, and Violation Vehicles [J]. Ieee Transactions on Industrial Electronics, 2019, 66(4): 3275-86
    [5] RAMYAR S, HOMAIFAR A, ANZAGIRA A, et al. Fuzzy modeling of drivers'' actions at intersections; proceedings of the 2016 World Automation Congress (WAC) [C]. F 31 July-4 Aug. 2016
    [6] BOUTON M, COSGUN A, KOCHENDERFER M J. Belief State Planning for Autonomously Navigating Urban Intersections [J]. Ieee Int Veh Sym, 2017, 825-30
    [7] HUBMANN C, QUETSCHLICH N, SCHULZ J, et al. A POMDP Maneuver Planner For Occlusions in Urban Scenarios [J]. 2019 30th Ieee Intelligent Vehicles Symposium (Iv19), 2019, 2172-9
    [8] SHU K, YU H, CHEN X, et al. Autonomous Driving at Intersections: A Critical-Turning-Point Approach for Left Turns [J/OL]. ArXiv, 2020, abs/2003.02409
    [9] DONG-KYOUNG K, SEONG-WOO K, SEUNG-WOO S. Decision Making for Automated Driving at Unsignalized Intersection [J]. 2015 15th International Conference on Control, Automation and Systems (ICCAS), 2015
    [10] HUBMANN C, SCHULZ J, BECKER M, et al. Automated Driving in Uncertain Environments: Planning With Interaction and Uncertain Maneuver Prediction [J]. IEEE Transactions on Intelligent Vehicles, 2018, 3(1): 517
    [11] MNIH V, KAVUKCUOGLU K, SILVER D, et al. Playing Atari with Deep Reinforcement Learning [J/OL]. ArXiv, 2013, abs/1312.5602
    [12] SILVER D, HUANG A, MADDISON C J, et al. Mastering the game of Go with deep neural networks and tree search [J]. Nature, 2016, 529(7587): 484-+
    [13] ZHOU M F, YU Y, QU X B. Development of an Efficient Driving Strategy for Connected and Automated Vehicles at Signalized Intersections: A Reinforcement Learning Approach [J]. Ieee Transactions on Intelligent Transportation Systems, 2020, 21(1): 433-43
    [14] CHEN W-L, LEE K, HSIUNG P-A. Intersection Crossing for Autonomous Vehicles based on Deep Reinforcement Learning [J]. 2019 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW), 2019, 1-2
    [15] ISELE D, RAHIMI R, COSGUN A, et al. Navigating Occluded Intersections with Autonomous Vehicles using Deep Reinforcement Learning [J]. IEEE International Conference on Robotics and Automation, 2018, 09:2034 - 2039
    [16] ZHIQIAN Q, KATHARINA M, JOHN M. D. Automatically Generated Curriculum based Reinforcement Learning for Autonomous Vehicles in Urban Environment [J]. 2018 IEEE Intelligent Vehicles Symposium (IV), 2018
    [17] dataset available: http://www.openits.cn/openPaper/614.jhtml
    [18] TSAY R S, TIAO G C. Consistent Estimates of Autoregressive Parameters and Extended Sample Autocorrelation Function for Stationary and Nonstationary ARMA Models [J]. Journal of the American Statistical Association, 1984, 79(385): 84-96
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-12-08
  • 最后修改日期:2021-03-09
  • 录用日期:2021-03-24
文章二维码