Open-Assistant
Open-Assistant copied to clipboard
Add: Korean QA dataset
For: #1157
This repository contains the Python code used to generate the Korean QA
dataset. Korean QA is a dataset designed to evaluate the ability of models to
perform question answering in korean natural language.
The dataset contains 1.74k instruction and answers, all of which are from Naver Kin, the number one QNA website in korea.
structure - [Instruction, Response, Source, Metadata]