torchgfn Implement a uniform forward policy as a random baseline method for easy benchmarking.

Implement a uniform forward policy as a random baseline method for easy benchmarking.

Open josephdviviano opened this issue 1 year ago • 0 comments

Such a policy would not be useful for applications, but maybe good for either verifying that A) your trained GFN is learning (relative to this baseline) or B) reporting a random baseline, in a paper.

Aug 22 '23 22:08 josephdviviano

torchgfn torchgfn copied to clipboard

Implement a uniform forward policy as a random baseline method for easy benchmarking.

torchgfn
torchgfn copied to clipboard