Shibi He comments

Results 14 comments of


                                            Shibi He

Implement optimality tightening

Hi guys, I have released the code at https://github.com/ShibiHe/Q-Optimality-Tightening. Please have a look. Best, Shibi

Speeding up Episodic control

You are asking really incisive questions. Thank you very much, Sudeep. The first question is about KNN algorithm. Due to the high dimension of the states and the dynamic LRU...

Speeding up Episodic control

Wow, thank you very much. I am having a vacation now and will be back school in two days. I was about to tell you that I got the reply...

Speeding up Episodic control

Hello. I checked your code and you were using KD tree. I have pushed my new commitment in which I implemented approximate KNN. Please have a look if possible. Thanks...

Question about quadratic penalties

Hi, please refer to Explain part in the readme for details.

Fewer steps per second as training progresses

It is probably because when the epsilon-greedy is less likely to be greedy, the inference takes more time. I have also noticed different game behaves really different. Some games' speeds...

Fewer steps per second as training progresses

I think I run a few random seeds and compute an average.

Fewer steps per second as training progresses

I remember I wrote another code specially for doing 30 no-op test. This code base is quite old, so I do not remember the details. My new implementation was using...

Question on upper bound

Good question. Theoretically, we should only use the upper bounds after the Q is sufficiently trained and we find upper bounds stabilize the training. In practice, we just use the...

Areas for improvement

How's your improvement? I would like to make grayscaling and frame skip too.