cosmopedia
cosmopedia copied to clipboard
Fantastic work! Is code data considered in Cosmopedia?
Wow, this is super cool work, and thanks for open sourcing everything!! I wonder if cosmopedia tries incorporating code data as seeds to rephrase them into high-quality data? We did some explorations in Magicoder for instruction tuning, but in our case, the "rephrasing" requires a very delicate prompt design, so I am quite excited about this development and would love to know any insights towards rephrasing code instructions.