Repeatedly train agent #75

martinv · 2018-12-13T01:45:53Z

I would like to train the agent throughout multiple episodes and force it to go back to initial position at the beginning of each episode. I am not able to reset the environment, however. The following 'pseudocode'
`
num_episodes = 10
episode_len = 30

env = marlo.init(join_token)

for ep in range(num_episodes):

 print("Running episode {}".format(ep))

 observation = env.reset()
 done = False

 t_iter = 0
 while (not done) and (t_iter <= episode_len):
     print("  t iter = {}".format(t_iter))
     _action = env.action_space.sample()
     obs, reward, done, info = env.step(_action)

     if done:
         break

     t_iter += 1

`

seems to execute once and then hangs with the error message:

Running episode 1
WARNING:marlo.base_env_builder:Error on attempting to start mission : A mission is already running.
WARNING:marlo.base_env_builder:Will attempt again after 3 seconds.
repeated multiple times until the whole simulation is killed.

How do I force the agent to make a 'clean start' at the beginning of each training episode?

The text was updated successfully, but these errors were encountered:

AndKram · 2018-12-14T09:15:40Z

Which mission are you running? It's certainly safer to loop until done and not attempt a reset while the mission is in progress. There is a quit message sent on reset but possibly the mission is not quitting. Does it work with episode_len very large?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repeatedly train agent #75

Repeatedly train agent #75

martinv commented Dec 13, 2018 •

edited

Loading

AndKram commented Dec 14, 2018 •

edited

Loading

Repeatedly train agent #75

Repeatedly train agent #75

Comments

martinv commented Dec 13, 2018 • edited Loading

AndKram commented Dec 14, 2018 • edited Loading

martinv commented Dec 13, 2018 •

edited

Loading

AndKram commented Dec 14, 2018 •

edited

Loading