Joshua Choo Yun Keat comments

Results 10 comments of


                                            Joshua Choo Yun Keat

Incorrect documentation for `warm_start` behavior on BaseForest-derived classes

take

Incorrect documentation for `warm_start` behavior on BaseForest-derived classes

Hi @cmarmo, I've submitted a pull request (https://github.com/scikit-learn/scikit-learn/pull/24579) based on @NicolasHug's comments in https://github.com/scikit-learn/scikit-learn/issues/20435#issuecomment-872835169. Could you help me take a look at it? Thanks!

Incorrect documentation for `warm_start` behavior on BaseForest-derived classes

@cmarmo I would like to help with this issue. How should this PR be done given #24764?

Incorrect documentation for `warm_start` behavior on BaseForest-derived classes

Thank you for the comments, @cmarmo and @glemaitre, I will work on a PR incorporating what both of you have suggested.

Pong2Player environment usage

Hey, sorry for the late reply. We are still working on this project so it won't be complete for another month or so. I saw that you are interested in...

Pong2Player environment usage

Ok, let me know if you have more questions on getting the Pong2Player code running

Pong2Player environment usage

I believe this is useful if you want to perform clipping of rewards. You could also do `reward = reward`, it should work as well

Pong2Player environment usage

Yes, I believe it is within the range of -1 and 1. The values, decided by the rom used, should be as described in the paper "Multiagent Cooperation and Competition...

Pong2Player environment usage

This is actually an implementation of the Xitari2Player environment. You can see the full list of actions at https://github.com/choo8/Xitari2Player/blob/master/ale_interface.hpp. I only included the 4 relevant actions in the training script.

Pong2Player environment usage

According to the paper, a game of Pong ends when 21 points is scored by either agent. Epochs are determined by number of iterations, where 250000 iterations would equal to...