Qwen3-Coder icon indicating copy to clipboard operation
Qwen3-Coder copied to clipboard

How synthetic data were generated?

Open wasiahmad opened this issue 1 year ago • 1 comments

Hello,

In the technical report, it was mentioned that CodeQwen1.5 was used to generate synthetic data, but there is no further detail. Is it possible to elaborate? For example, what kind of technique is being used for synthetic data generation?

I found this tweet that shares some detail. Is it correct? If yes, what are those code snippets which are involved in instruction generation?

wasiahmad avatar Oct 31 '24 16:10 wasiahmad

Some details can be found in the technical report: https://arxiv.org/abs/2409.12186

huybery avatar Nov 18 '24 03:11 huybery