Adding my opinion on this (old) thread: I implemented the formula proposed above `pi -= (1-valids)*1000` I also implemented [this paper](https://arxiv.org/pdf/2006.14171.pdf) (check also the related github repo) with `pi =...
Wouldn't it be better to set the target of dummy points to what regressor would expect? I mean register_dummy() should use `optimizer._gp.predict(params)` as target.
Please use the following file: https://github.com/cestpasphoto/alpha-zero-general/blob/master/splendor/example_onnx_file.onnx To answer the auto-label, **CUDA is NOT involved** here, it happens with CPU-only environment.