论文标题

Gencos的行为基于Q学习的模型,改善了二分法

GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy

论文作者

Jia, Qiangang, Hu, Zhaoyu, Li, Yiyan, Yan, Zheng, Chen, Sijie

论文摘要

Q学习被广泛用于模拟电力市场​​中的发电公司(Gencos)的行为。但是,现有的Q学习方法通​​常需要许多迭代来融合,这在实践中耗时且效率低下。为了提高计算效率,本文提出了一种新颖的Q学习算法改善了二分法。此方法通过逐步将状态空间和动作空间进行二分化来修改Q表的更新过程。仿真导致一个重复的Cournot游戏显示了拟议算法的有效性。

Q learning is widely used to simulate the behaviors of generation companies (GenCos) in an electricity market. However, existing Q learning method usually requires numerous iterations to converge, which is time-consuming and inefficient in practice. To enhance the calculation efficiency, a novel Q learning algorithm improved by dichotomy is proposed in this paper. This method modifies the update process of the Q table by dichotomizing the state space and the action space step by step. Simulation results in a repeated Cournot game show the effectiveness of the proposed algorithm.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源