论文标题

一种动态的保持方法,可以根据Q学习算法稳定公交线路,并具有多稳定性

A Dynamic Holding Approach to Stabilizing a Bus Line Based on the Q-learning Algorithm with Multistage Look-ahead

论文作者

He, Sheng-Xue, He, Jian-Jia, Liang, Shi-Dong, Dong, June Qiong, Yuan, Peng-Cheng

论文摘要

高频总线线的不可靠服务和不稳定的操作显示为总线束和沿公交线路的不平衡分布。尽管已经提出了许多控制策略,例如静态和动态的持有策略来解决上述问题,但其中许多人对真正的公交线路运营进行了一些过度简化的假设。因此,他们很难不断适应不断发展的复杂系统。鉴于这种动态设置,我们提出了一种自适应保持方法,该方法将经典的近似动态编程(ADP)与多阶段的外观机制结合在一起。持有时间,这是本研究中使用的唯一控制手段,将通过估计其对总线线系统在剩余观察期间的操作稳定性的影响来确定。引入ADP模型的经典Q学习算法中的多阶段外观机制使该算法更快,更容易地通过其早期的不稳定阶段。在实施新的持有方法期间,可以将持有操作的过去经验有效地累积到用于近似不可用的Q因子的人工神经网络中。在新方法中使用详细的仿真系统使得能够考虑大多数可能导致不稳定的原因。数值实验表明,新的保持方法可以通过均匀分布的进展并彻底删除总线束来稳定系统。与终端电台持有策略相比,新方法为乘客的等待时间较短,为较短的等待时间带来了更可靠的公交线路。

The unreliable service and the unstable operation of a high frequency bus line are shown as bus bunching and the uneven distribution of headways along the bus line. Although many control strategies, such as the static and dynamic holding strategies, have been proposed to solve the above problems, many of them take on some oversimplified assumptions about the real bus line operation. So it is hard for them to continuously adapt to the evolving complex system. In view of this dynamic setting, we present an adaptive holding method which combines the classic approximate dynamic programming (ADP) with the multi-stage look-ahead mechanism. The holding time, that is the only control means used in this study, will be determined by estimating its impact on the operation stability of the bus line system in the remained observation period. The multi-stage look-ahead mechanism introduced into the classic Q-learning algorithm of the ADP model makes the algorithm get through its earlier unstable phase more quickly and easily. During the implementation of the new holding approach, the past experiences of holding operations can be cumulated effectively into an artificial neural network used to approximate the unavailable Q-factor. The use of a detailed simulation system in the new approach makes it possible to take into accounts most of the possible causes of instability. The numerical experiments show that the new holding approach can stabilize the system by producing evenly distributed headway and removing bus bunching thoroughly. Comparing with the terminal station holding strategies, the new method brings a more reliable bus line with shorter waiting times for passengers.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源