Edouard Leurent

Results 248 comments of Edouard Leurent

you can configure the DQN agent with ``` dict(exploration= dict(temperature=1.0, final_temperature=0.1, tau=5000) ) ```

Hi, sorry I should really update the instruction. The analyze.py script is deprecated (I only used it to plot the rewards / episode length stats etc), you should use tensorboard...

> we have agents and trainer folders, what is each one for? The `agents` folder defines RL algorithms (an agent interact with an environment and updates its internal model), while...

Hi, thanks for the feedback! 1. You're right, maybe it's a bit confusing. My intent was to implement a lightweight RL library with many (unrelated) agents, and add my own...

1. the inputs is an array containing a set of per-vehicle features. For instance, I used position x, y (absolute or relative to ego vehicle position), velocity vx vy (absolute...

No worries at all, glad you found this helpful!

Hi, No, I have no idea why PyCharm would not stop at the breakpoint... I tried it and it worked fine ![image](https://user-images.githubusercontent.com/1706935/226110004-8f9b0b0f-3398-4d18-b878-6280c9f5d63a.png) Are you running the program in debug mode...

> Do the columns of the attention matrix not correspond to the corresponding vehicles? Why do we need to assign attention values to vehicles based on their distances? They do!...

I think there can be two roles: 1. the decision may depend on the state of the vehicle, e.g. its current position, or its speed or heading. So the attention...

Please see e.g. [this colab](https://colab.research.google.com/github/eleurent/highway-env/blob/master/scripts/intersection_social_dqn.ipynb)