examples icon indicating copy to clipboard operation
examples copied to clipboard

Example notebook with Datalab on a text dataset with sota LLM & embeddings model

Open jwmueller opened this issue 1 year ago • 0 comments

Make a version of this tutorial: https://docs.cleanlab.ai/stable/tutorials/datalab/text.html but using more modern ML models. pred_probs can be produced by a (pretrained) LLM, and features produced via a recently popular Embeddings model.

Recommend using models from HuggingFace. Try to select a dataset where the detected issues are interesting, particularly one where the under-performing group issue is present.

jwmueller avatar Jan 02 '25 08:01 jwmueller