Situational awareness and path prediction combining game theory and reinforcement learning
CSTR:
Author:
Affiliation:

1.State Grid Chongqing Electric Power Company, Chongqing 400014, P. R. China;2.Electric Power Research Institute of State Grid Chongqing Electric Power Company, Chongqing 401123, P. R. China;3.State Grid Electric Power Research Institute Co., Ltd., Nanjing 211106, P. R. China;4.Nanjing NARI Information Communication Technology Co., Ltd., NARI Group Co., Ltd., Nanjing 211106, P. R. China;5.School of Software Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400000, P. R. China

Clc Number:

TP393

Fund Project:

Supported by Science and Technology Projects of State Grid Chongqing Electric Power Company (520626190067).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Cybersecurity situational awareness technology plays a critical role in assessing network security status, predicting potential attack paths, and assisting administrators in implementing effective defenses. Traditional methods for network situation assessment mostly rely on theoretical analysis, limiting their practicality in real-world networks. Additionally, the complexity of sensor-collected data often results in excessive storage demands. To address these challenges, this paper proposes a dynamic network attack-defense perception model that integrates reinforcement learning and game theory to enhance situational awareness and predict potential attack paths. The approach begins with the design of a hierarchical analytic process using a priority relation matrix to calculate system losses and assess security posture. Next, the Boltzmann probability distribution is employed to calculate the mixed-strategy Nash equilibrium, identifying optimal strategic responses. Finally, an improved Q-learning algorithm, in combination with game-theoretic principles, is used to dynamically model network state transitions, enabling accurate prediction of attack paths and supporting defenders in selecting optimal defense strategies. Simulation results validate the model’s effectiveness and practicality in complex network environments.

    Reference
    Related
    Cited by
Get Citation

杨云,梁花,魏兴慎,李洋,刘俊.结合博弈论与强化学习的态势感知与路径预测[J].重庆大学学报,2025,48(6):84~97

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:October 12,2020
  • Revised:
  • Adopted:
  • Online: July 11,2025
  • Published:
Article QR Code