mobile_app_open
mobile_app_open copied to clipboard
LLM Dataset Implementation
This issue is a general container for matters relating to datasets in general. Discussions on TinyMMLU or IFEval specifically should go in the sub issues for this one.
Current Status
TinyMMLU
- [x] Dataset is converted from
.parquetto.tfrecordvia a utility script. - [x] Dataset loads
.tfrecordand stores data inside samples. - [x] Dataset provides samples by id to driver/backend in proper format.*
- [x] Dataset Processes output from driver/backend.*
- [x] Dataset calculates and provides accuracy using output data on device.
IFEval
- [x] Dataset is converted from
.jsonlto.tfrecordvia a utility script. - [x] Dataset loads
.tfrecordand stores data inside samples. - [x] Dataset provides samples by id to driver/backend in proper format.*
- [x] Dataset Processes output from driver/backend.*
- [x] Dataset calculates and provides accuracy using output data on device.
* This includes tokenization/detokenization using common SentencePiece utility code.