distilabel issues

Adapt `EvolQuality` and `EvolInstruct` to share a combined base class

1

**Is your feature request related to a problem? Please describe.** Both tasks seem to share a lot of logic so there is some code duplication. **Describe the solution you'd like**...

davidberenstein1957

enhancement

Add tiny versions of `transformers` models for the `tests`

**Is your feature request related to a problem? Please describe.** We might suffer from downloading unneed large models. **Describe the solution you'd like** Something like this https://huggingface.co/distilabel-internal-testing/tiny-random-mistral was proposed by...

davidberenstein1957

enhancement

DSPy as a `Step`

3

## Description Add a custom `Step` that runs `DSPy` even if it's only an example on how to use it via `distilabel` v1.0.0. The step could optimize a prompt from...

alvarobartt

integrations

Add synchronous implementations of components

**Is your feature request related to a problem? Please describe.** Async is cool but debugging can be a pain. **Describe the solution you'd like** I would love to have synchronous...

davidberenstein1957

idea

[DOCS] tutorial on using Notus and Llama2 to generate finance preference dataset

4

Create a notebook showing an end2end workflow with distilabel to create a preference dataset based on a ~200-page economic document (IMF World Economic Outlook, April 2023). The preference dataset could...

kcentric

[DOCS] add clarification in distilabel vLLM reference to specify dtype

1

## Which page or section is this issue related to? Currently the code snippet in the vLLM section of the guide (https://distilabel.argilla.io/latest/technical-reference/llms/#vllm) looks like: ```python llm = vLLM( model=LLM(model="argilla/notus-7b-v1"), task=TextGenerationTask(),...

kcentric

[FEATURE] Introduce FollowUp generator

6

## Description A high impact task for distilabel is one that generates follow up turns or multi-turn dialogues (which then can be criticized/ranked Given a conversation (or at least a...

dvsrepo

enhancement

[FEATURE] add `#-turn` information to the `ChatTask` when using the `to_argilla` method

**Is your feature request related to a problem? Please describe.** In [this PR](https://github.com/argilla-io/distilabel/pull/203), we introduced the `ChatTask` but we want to add as much information to the data we send...

davidberenstein1957

enhancement

[DOCS] Making the Open In Colab and Open in GitHub buttons part of the template, automatically generated

The idea is to set up the Open In Colab and Open GitHub Source as a template overridden feature of the mkdocs template, that should be possible. We have some...

ignacioct

documentation

[FEATURE] Preference task support for multi-turn dialogues

Our current preference pipelines work with the assumption of single-turn (instruction) datasets. To generate high-quality data preferences we need to support multi-turn data.

dvsrepo

enhancement

distilabel
distilabel copied to clipboard

Metadata

Adapt `EvolQuality` and `EvolInstruct` to share a combined base class

Add tiny versions of `transformers` models for the `tests`

DSPy as a `Step`

Add synchronous implementations of components

[DOCS] tutorial on using Notus and Llama2 to generate finance preference dataset

[DOCS] add clarification in distilabel vLLM reference to specify dtype

[FEATURE] Introduce FollowUp generator

[FEATURE] add `#-turn` information to the `ChatTask` when using the `to_argilla` method

[DOCS] Making the Open In Colab and Open in GitHub buttons part of the template, automatically generated

[FEATURE] Preference task support for multi-turn dialogues

← Metadata

Owner

Metadata

distilabel distilabel copied to clipboard

Metadata

← Metadata

Owner

Metadata

distilabel
distilabel copied to clipboard