distilabel icon indicating copy to clipboard operation
distilabel copied to clipboard

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Results 168 distilabel issues
Sort by recently updated
recently updated
newest added

[Updated: Added suggestion 2. for Ultrafeedback] This is an issue to discuss the defaults for (some) Steps and Task. The main idea is to think about the most frequent uses...

enhancement

**Is your feature request related to a problem? Please describe.** 01.AI has recently released their API platform which allows access to models such as Yi-Large, currently in the top 10...

enhancement
good first issue

## Which page or section is this issue related to? As proposed by @davanstrien In some places we load data from the hub, which might be shown through the embedded...

documentation
good first issue
help wanted

**Describe the bug** I followed the instructions as per the latest documentation: https://distilabel.argilla.io/latest/sections/getting_started/installation/ and ran the code at the quickstart section, but faced some encoding errors. My code and error...

**Is your feature request related to a problem? Please describe.** Within the docs, we do advertise that people can use `Steps` as Standalone components, which prove useful for quick demos,...

enhancement

## Description A section or guide describing the different patterns that can be built in a distilabel pipeline would be useful for users.

documentation

## Which page or section is this issue related to? https://distilabel.argilla.io/latest/components-gallery/steps https://distilabel.argilla.io/latest/components-gallery/tasks https://distilabel.argilla.io/latest/components-gallery/llms (perhaps not needed) ## What are you documenting, or what change are you making in the documentation?...

documentation

**Is your feature request related to a problem? Please describe.** Take a look at [CodecLM: Aligning Language Models with Tailored Synthetic Data](https://arxiv.org/abs/2404.05875) to see if it could be integrated. **Describe...

enhancement
integrations

**Is your feature request related to a problem? Please describe.** We need some integration tests for the LLMs running once a week or before a release to avoid errors with...

enhancement

**Is your feature request related to a problem? Please describe.** Within 1.3 we deprecated this naming as part of #755. **Describe the solution you'd like** N.A. **Describe alternatives you've considered**...

enhancement