tinker-cookbook Add code-rl recipe with DeepCoder

Add code-rl recipe with DeepCoder

Open Xiuyu-Li opened this issue 1 week ago • 1 comments

trafficstars

This PR adds an example recipe for reinforcement learning (RL) to solve competitive programming problems using Tinker and the DeepCoder dataset. The recipe is located at tinker_cookbook/recipes/code_rl. The environment uses sandboxing via Sandbox Fusion for security, without introducing any additional dependencies when running in Docker.

Nov 10 '25 21:11 Xiuyu-Li

this is pretty neat, i agree

Nov 10 '25 21:11 TieMoulton

Cool!

Nov 17 '25 07:11 joschu

Thanks for adding this! We'll review shortly!

Nov 17 '25 07:11 joschu

Great! I ran the experiment in the README and seems to be working as expected. Merging now

Nov 18 '25 18:11 Tiiiger

tinker-cookbook tinker-cookbook copied to clipboard

Add code-rl recipe with DeepCoder

tinker-cookbook
tinker-cookbook copied to clipboard