WorldOnRails icon indicating copy to clipboard operation
WorldOnRails copied to clipboard

about data collecting in data_phase0

Open Watson52 opened this issue 2 years ago • 3 comments

Hi Chen. Thanks for a lot of open source works and I am trying to follow them. I jump from the data collecting about LAV as you said there are similarities between them. I found that the random collector return a random control including throttle and steer. I feel confused how could the agent arrived the goal under the random control? ~~Does WOR and LAV use the same agent?~~ How does it chose random routes?

Watson52 avatar Apr 02 '22 04:04 Watson52

Hi,

Thanks a lot for your interest in our projects. The distinction between LAV and Rails (WOR) in data collection is: LAV uses an expert; Rails first use a completely random data to train a vehicle kinematics mode; it then uses this model to build a crude agent to collect the main traces; the main traces replay and we use model-based RL offline to distill an agent.

Let me know if you have further questions.

dotchen avatar Apr 02 '22 05:04 dotchen

Thanks for your reply, it make me clear about the WOR. I think I should relize how the leaderboard and scenario runner work first. I will ask you if I meet problem. Thanks again.

Watson52 avatar Apr 02 '22 07:04 Watson52

Sure thing. Feel free to also email me at [email protected] for more questions.

dotchen avatar Apr 02 '22 07:04 dotchen