Stockfish CI for Reinforcement Learning

CI for Reinforcement Learning

Open nodchip opened this issue 5 years ago • 3 comments

Currently the CI for compilation and Supervised Learning were setup. We could also need the CI for Reinforcement Learning. i.e. CI for training data generation with an existing net file, and training with the training from an existing net file.

Sep 13 '20 05:09 nodchip

this should be easy to add in tests/instrumented_learn.sh, we just need to generate a variant of learn01.exp ? Is the only change needed to set 'SkipLoadingEval false' ?

Sep 13 '20 05:09 vondele

Is the only change needed to set 'SkipLoadingEval false' ?

In training data generation, we need to load an existing file, and use the raw NNUE eval values. We could need to set SkipLoadingEval to false, set EvalFile to a proper net file path, and set Use NNUE to pure.

In training, we also need to load an existing file, and use the raw NNUE eval values. We could need to set SkipLoadingEval to false, set EvalFile to the one used in the training data generation, and set Use NNUE to pure.

After we setup CI for Reinforcement Learning, CI tests may fail because of https://github.com/nodchip/Stockfish/issues/128. EDIT: I'm investigating https://github.com/nodchip/Stockfish/issues/128.

Sep 13 '20 05:09 nodchip

#128 has been fixed.

Sep 13 '20 10:09 nodchip

Stockfish Stockfish copied to clipboard

CI for Reinforcement Learning

Stockfish
Stockfish copied to clipboard