Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Using CodeReview StackExchange data with open-bugger

Open caridorc-tergiliti opened this issue 2 years ago • 3 comments

StackExchange CodeReview contains a lot of correct open source code snippets, simple and self-contained, especially if we filter using the beginner tag. We can add bugs to it with open bugger and use it to train. The license of all content there is creative commons.

caridorc-tergiliti avatar Feb 09 '23 16:02 caridorc-tergiliti

assigned to you for now. But we need someone to take this on to help as @caridorc-tergiliti doesn't have time to do this!

huu4ontocord avatar Feb 09 '23 17:02 huu4ontocord

Automating this looks hard, as we need a way to pull code from questions, that contain both code and text, also I suggest using beginner questions that contain less and simpler code: https://codereview.stackexchange.com/questions/tagged/beginner

The beginner questions are only around 7 thousands so this could also be done manually with crowdsourcing, also allowing the people to add the error that happens when trying the run the wrong code to the prompt.

RiccardoRiglietti avatar Feb 10 '23 13:02 RiccardoRiglietti

Hi @caridorc-tergiliti -can you give us a status? is this too hard to do per @RiccardoRiglietti. If so, we can change scope or close this issue. thank you!

huu4ontocord avatar Feb 24 '23 06:02 huu4ontocord