Finding the Optimal Security Policies for Autonomous Cyber Operations With Competitive Reinforcement Learning

Autor: Garrett Mcdonald, Li Li, Ranwa Al Mallah
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: IEEE Access, Vol 12, Pp 120292-120305 (2024)
Druh dokumentu: article
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2024.3446310
Popis: Reinforcement Learning (RL) has been responsible for some of the most impressive advances in the field of Artificial Intelligence (AI). Research in competitive RL has shown that multiple agents competing in an adversarial environment can learn simultaneously in order to discover their optimal decision-making policies. Competitive RL algorithms have been used to train performant AI for a variety of games and optimization problems. Cybersecurity is a domain where the emerging research in competitive RL is being considered for its real-world application. In order to develop Automated Cyber Operations (ACO) tools using RL, various open-source environments are available to simulate network security incidents. However, the existing research in these environments is typically one-sided: a Red or Blue agent is trained to optimize their decision-making against a static opponent. Competitive RL has not been attempted in these emerging environments. In this work, we trained agents using competitive RL to approximate their game theory optimal policies in a simulated ACO environment. We showed that near-optimal behavior was reached gradually through fictitious play demonstrating that these strategies can be used to approximate the optimal policies for agents involved in sophisticated sequential decision-making during a cyber attack.
Databáze: Directory of Open Access Journals