Qwen3-Coder
Qwen3-Coder copied to clipboard
How synthetic data were generated?
Hello,
In the technical report, it was mentioned that CodeQwen1.5 was used to generate synthetic data, but there is no further detail. Is it possible to elaborate? For example, what kind of technique is being used for synthetic data generation?
I found this tweet that shares some detail. Is it correct? If yes, what are those code snippets which are involved in instruction generation?
Some details can be found in the technical report: https://arxiv.org/abs/2409.12186