distilabel
distilabel copied to clipboard
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
[Updated: Added suggestion 2. for Ultrafeedback] This is an issue to discuss the defaults for (some) Steps and Task. The main idea is to think about the most frequent uses...
**Is your feature request related to a problem? Please describe.** 01.AI has recently released their API platform which allows access to models such as Yi-Large, currently in the top 10...
## Which page or section is this issue related to? As proposed by @davanstrien In some places we load data from the hub, which might be shown through the embedded...
**Describe the bug** I followed the instructions as per the latest documentation: https://distilabel.argilla.io/latest/sections/getting_started/installation/ and ran the code at the quickstart section, but faced some encoding errors. My code and error...
**Is your feature request related to a problem? Please describe.** Within the docs, we do advertise that people can use `Steps` as Standalone components, which prove useful for quick demos,...
## Description A section or guide describing the different patterns that can be built in a distilabel pipeline would be useful for users.
## Which page or section is this issue related to? https://distilabel.argilla.io/latest/components-gallery/steps https://distilabel.argilla.io/latest/components-gallery/tasks https://distilabel.argilla.io/latest/components-gallery/llms (perhaps not needed) ## What are you documenting, or what change are you making in the documentation?...
**Is your feature request related to a problem? Please describe.** Take a look at [CodecLM: Aligning Language Models with Tailored Synthetic Data](https://arxiv.org/abs/2404.05875) to see if it could be integrated. **Describe...
**Is your feature request related to a problem? Please describe.** We need some integration tests for the LLMs running once a week or before a release to avoid errors with...
**Is your feature request related to a problem? Please describe.** Within 1.3 we deprecated this naming as part of #755. **Describe the solution you'd like** N.A. **Describe alternatives you've considered**...