tinker-cookbook icon indicating copy to clipboard operation
tinker-cookbook copied to clipboard

Add code-rl recipe with DeepCoder

Open Xiuyu-Li opened this issue 1 week ago • 1 comments
trafficstars

This PR adds an example recipe for reinforcement learning (RL) to solve competitive programming problems using Tinker and the DeepCoder dataset. The recipe is located at tinker_cookbook/recipes/code_rl. The environment uses sandboxing via Sandbox Fusion for security, without introducing any additional dependencies when running in Docker.

Xiuyu-Li avatar Nov 10 '25 21:11 Xiuyu-Li

this is pretty neat, i agree

TieMoulton avatar Nov 10 '25 21:11 TieMoulton

Cool!

joschu avatar Nov 17 '25 07:11 joschu

Thanks for adding this! We'll review shortly!

joschu avatar Nov 17 '25 07:11 joschu

Great! I ran the experiment in the README and seems to be working as expected. Merging now

Tiiiger avatar Nov 18 '25 18:11 Tiiiger