Samuel Cahyawijaya comments

Results 31 comments of


                                            Samuel Cahyawijaya

Create dataset loader for XCOPA

XCOPA would fit MCQA format with `id` as idx, `context` as premises, `question` as question, `choices` is a list [choice_1, choice_2], and `label` is the index label from the dataset

Create dataset loader for Indo4B Plus

#self-assign

Create dataset loader for ID Short Answer Grading

I think this one can be framed as `nusantara_pairs` schema with `text_1` denotes question and `text_2` denotes answer, and the label is the score.

Create dataset loader for ID Short Answer Grading

@holylovenia : yes, that would be great! I agree that we need to add a new task for this.

Create dataset loader for Indo4B

#self-assign

Create dataset loader for JV-ID ASR

Closed on https://github.com/IndoNLP/nusa-crowd/pull/240

Create dataset loader for JV-ID ASR

Update the title and description to JV-ID TTS instead of JV-ID ASR

Create dataset loader for SU-ID ASR

Closed on https://github.com/IndoNLP/nusa-crowd/pull/240

Create dataset loader for Indo MultiModal Dataset

Hello, I am closing this one since we already have the other 3 issues for the multimodal dataset.

Create dataset loader for MultiLexNorm

this one can be framed as a paraphrasing task using `nusantara_t2t` schema with `text_1` denotes the original sentence and `text_2` denotes the normalized sentence