distilabel icon indicating copy to clipboard operation
distilabel copied to clipboard

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Results 168 distilabel issues
Sort by recently updated
recently updated
newest added

## Description As part of issue https://github.com/argilla-io/distilabel/issues/832, we started adding default structured outputs for tasks that could benefit from it. `InstructionBacktranslation` is a good candidate. PR https://github.com/argilla-io/distilabel/pull/868 can be used...

good first issue
improvement

## Description As part of issue https://github.com/argilla-io/distilabel/issues/832, we started adding default structured outputs for tasks that could benefit from it. `SelfInstruct` is a good candidate. PR https://github.com/argilla-io/distilabel/pull/868 can be used...

good first issue
improvement

**Describe the bug** To simplify the code, if no name is given to a pipeline a random one is generated. **To Reproduce** Code to reproduce ```python from distilabel.pipeline import Pipeline...

bug

**Is your feature request related to a problem? Please describe.** Some users would like to inspect the dataset as it's being created in the Hugging Face Hub. **Describe the solution...

enhancement

**Is your feature request related to a problem? Please describe.** I feel distinguishing between singular and plural names complicates workflows and code, e.g., “generation” and “generations”, and I’ve seen similar...

enhancement
good first issue
help wanted

**Is your feature request related to a problem? Please describe.** Currently, `LLM.generate` return type is constrained to either `str` (assistant message) or `None` (assistant failed generating or something else happened)....

enhancement

**Is your feature request related to a problem? Please describe.** `process` function created with `step` decorator doesn't have a way to access `Step` class attributes. **Describe the solution you'd like**...

enhancement

**Is your feature request related to a problem? Please describe.** I cannot access the input and outputs of a class without initializing it. **Describe the solution you'd like** Given that...

enhancement

I remember we've briefly discussed this in the past and there were some actions, but I think this can have a positive impact on usability. Go from: ```python with Pipeline(...

enhancement