Adversarial Multi-Armed Bandit with Variable Plays
This code is for paper:
Adversarial Online Learning with Variable Plays in the Evasion-and-Pursuit Game: Theoretical Foundations and Application in Connected and Automated Vehicle Cybersecurity
Yiyang Wang, Neda Masoud. DOI: 10.13140/RG.2.2.17253.73440/1
-
AdversarialGame.py
: Run pursuit-evasion game between the two players. -
compare.py
: Compare Exp3.M-VP against Exp3, Exp3.M, UCB, and epsilon-greedy. -
Exp3M_VP
: Run Exp3M-VP algorithm with synthetic reward vectors.
For more info please contact via my email.
Check out the abstract and full paper here.