Samuel Cahyawijaya
Samuel Cahyawijaya
XCOPA would fit MCQA format with `id` as idx, `context` as premises, `question` as question, `choices` is a list [choice_1, choice_2], and `label` is the index label from the dataset
#self-assign
I think this one can be framed as `nusantara_pairs` schema with `text_1` denotes question and `text_2` denotes answer, and the label is the score.
@holylovenia : yes, that would be great! I agree that we need to add a new task for this.
#self-assign
Closed on https://github.com/IndoNLP/nusa-crowd/pull/240
Update the title and description to JV-ID TTS instead of JV-ID ASR
Closed on https://github.com/IndoNLP/nusa-crowd/pull/240
Hello, I am closing this one since we already have the other 3 issues for the multimodal dataset.
this one can be framed as a paraphrasing task using `nusantara_t2t` schema with `text_1` denotes the original sentence and `text_2` denotes the normalized sentence