distilabel
distilabel copied to clipboard
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
## Description In `distilabel` v1.0.0 we included the task `TextGeneration` and at the last moment we decided to include support for an already formatted chat like object i.e. a list...
**Is your feature request related to a problem? Please describe.** The implementation of the structured outputs in #601 only allows for a single structure (json schema, `BaseModel` in case of...
## Description Provide a `distilabel` docker image that can be used to execute a pipeline within. This is useful for executing `distilabel` pipelines on Cloud providers with serverless solutions. It...
just wanted to know how to use mistral api,, im a newbiw
[FEATURE] Allow passing path to YAML file containing pipeline runtime parameters in `distilabel run`
**Is your feature request related to a problem? Please describe.** Providing runtime parameters using option `--param` of `distilabel run` can be cumbersome. **Describe the solution you'd like** Add a `--runtime-parameters-path`...
Closes #890 data:image/s3,"s3://crabby-images/ce73b/ce73b35891af8255c783e591f8a128b0212529cd" alt="image"
**Describe the bug** I want to use Ultrafeedback task in a pipeline, but i have already the dataset, so the pipeline include only loading the dataset and after pass it...
## Description This PR implements cache at step level. Previously, we computed a signature for a pipeline, and when this signature changed, we recomputed everything. Now the idea is to...
## Description ⚠️ Work in progress This PR improves the `FormatTextGenerationSFT` task to allow preparing fine tuning datasets with function calling.