Dota2 Bot is an dota2 creep blocking AI bot was implement by reinforcement learning alogorithums via. Deep Deterministic Policy Gradients (DDPG). The environment was developed base on CreepBlockAI. It's part of my graduated project. (The Exploration Effect of Reinforcement Learning in Multiplayer Online Battle Arena Game).
Demo Video
- Dota2
- Dota2 Workshop Tools DLC
- python3.6
- pip
$ pip install -r requirement.txt
Correct settting is nessesery for corect running task such as
- training by uniform noise
- traing by Ornstein-Uhlenbeck
- testing (evaluating)
>>> BUFFER_SIZE = 100000
>>> GAMMA = 0.99 # Discounted Factor
>>> BATCH_SIZE = 200
>>> TAU = 0.001 # Target Network HyperParameters
>>> LRA = 0.0001 # Learning rate for Actor
>>> LRC = 0.001 # Lerning rate for Critic
>>> ACTION_DIM = 2 # x_pos , y_pos
>>> STATE_DIM = 11 # of sensors input
>>> ACTOR_HIDDEN1_UNITS = 150
>>> ACTOR_HIDDEN2_UNITS = 200
>>> CRITIC_HIDDEN1_UNITS = 150
>>> CRITIC_HIDDEN2_UNITS = 200
>>> TRAIN = 0
>>> EXPLORE = 20
>>> OU = 0
>>> MU = -10
>>> SIGMA = 30
>>> REPLACE_FREQ = 1
>>> BOOTSTRAP_FREQ = 5
>>> SAVE_FREQ = 1000
$ python server/app.py
$ python server/reload.py --timestamp 1521285961 --ep 151000
$ python server/reload.py --timestamp 1523632535 --ep 118000