CrowdNav icon indicating copy to clipboard operation
CrowdNav copied to clipboard

train question

Open anlanxuan opened this issue 4 years ago • 3 comments

Excuse me,can you tell me how to train the agent in linear policy? and when I modified the train parameter,after the imitating learing is done, when run the command explorer.run_k_episodes(env.case_size['val'], 'val', episode=episode), the action = self.robot.act(ob) ob, reward, done, info = self.env.step(action) show the action is None (in vx = human.vx - action.vx), can you give me some advice?

anlanxuan avatar Dec 03 '20 02:12 anlanxuan

Are you referring to the linear policy that runs in a straight line and are you training a policy to micmic the moving forward only behavior?

It's hard to say what causes this problem since you might have made some changes and I also haven't looked at the code for a while. What I would suggest is to troubleshoot this by using the original code first and adding back changes you made step by step so that you could locate which change caused this bug.

ChanganVR avatar Dec 09 '20 03:12 ChanganVR

thanks. if I train the agent by imitating policy(orca),and train the value network by drl policy(cadrl or sarl), whether the agent have the capability to cope with human with linear policy?and the max num is 10?

anlanxuan avatar Dec 09 '20 06:12 anlanxuan

whether the agent have the capability to cope with human with linear policy I think so. Humans with linear policy should be a relative straight forward case to deal with

and the max num is 10? The max number of humans? I remember it can be set to any numbers you like.

ChanganVR avatar Dec 09 '20 18:12 ChanganVR