Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Add: Korean QA dataset

Open CertifiedJoon opened this issue 2 years ago • 0 comments

For: #1157

link to dataset

This repository contains the Python code used to generate the Korean QA dataset. Korean QA is a dataset designed to evaluate the ability of models to perform question answering in korean natural language.

The dataset contains 1.74k instruction and answers, all of which are from Naver Kin, the number one QNA website in korea.

structure - [Instruction, Response, Source, Metadata]

CertifiedJoon avatar Jul 06 '23 18:07 CertifiedJoon