Tobias Pitters issues

Results 65 issues of


                                            Tobias Pitters

update entities

Add entities, which are refered in model/model_training/custom_datasets/formatting.py

Implement Dataset Entry for Datasets

We have a new `DatasetEntry` class which helps us to generalize over datasets and enforce a common formatting. We need to implement this class for a couple of more datasets:...

good first issue

Implement RM Mode for DatasetEntry

Currently the `DatasetEntry` class cannot be used for reward model training, and a `NotImplementedError` is thrown, see https://github.com/LAION-AI/Open-Assistant/blob/main/model/model_training/custom_datasets/formatting.py#L88. This should be fixed as soon as possible. The response structure can...

update dataset entry to support list of list type

#2827 In the [PR to introduce RM for the dataset entry class](https://github.com/LAION-AI/Open-Assistant/pull/2867) I forgot that if we have RM, we'll have multiple answers per question so `[Q1, (A1, A12)]` but...

Can't install model_training

Since the commit https://github.com/LAION-AI/Open-Assistant/pull/2206 I can't install model_training anymore. It looks like we try to install `ray` from a private aws artifactory via `trlx`. The error (while just trying to...

bug

return ds entry for instruction

Update instruction datasets to return the `DatasetEntry` class. Also checked with `python check_dataset_appearances.py -d humaneval_mbpp_codegen_qa --cache_dir .cache --mode sft` but did not find anything.

Improve Dataset Entry to add system tag for back-and-forth conversations

Add system tag for each answer in a back and forth conversation. So we have to convert `[Q1, A1, Q2, A2]` to `q1attrib1a1q2attrib2a2` This also includes changing the prompter and...

WIP: add system tag for each question

closes #2972

DatasetEntry Roadmap

This is the suggestion for a major refactoring of the data preprocessing in the model_training `trainer_sft`, `trainer_rm` and `trainer_rl` logic. The crucial point here is 4. but we need the...

Return correct type if taking list of scalar value in aggregation

closes #8683