distilabel
distilabel copied to clipboard
[FEATURE] Return generation statistics
Along with the generated dataset, it would be good to return a data structure containing statistics of the generation such as elapsed time, total tokens generated by the labeller, etc.
IMO we can add those within the DatasetCard
to be generated too, as well as in a separate file as statistics.json
or similar.