Jonas Mueller comments

Results 180 comments of


                                            Jonas Mueller

Simpler or current model I should use to predict probabilities?

I'd recommend the current fine-tuned sentence transformer

Upgrade CI from deprecated macOS-12

related issue: https://github.com/cleanlab/cleanlab/issues/1111

Formatting Object Detection Labels - Multiple Predictions but Empty Labels

This should be addressed once this PR is in: https://github.com/cleanlab/cleanlab/pull/1235

undo version caps for sklearn and huggingface hub in docs/requirements.txt

To unpin sklearn, must first wait for xgboost package to release new version that fixes this issue: https://github.com/dmlc/xgboost/issues/11093

Active learning for segmentation

The current `get_active_learning_scores()` method is only designed for classification tasks at the moment.

Can I use CleanLab for a regression task dataset with numerous (>40) numerical and categorical variables?

Yes you should be able to follow: https://docs.cleanlab.ai/stable/tutorials/regression.html Especially the final section: https://docs.cleanlab.ai/stable/tutorials/regression.html#5.-Other-ways-to-find-noisy-labels-in-regression-datasets

Hallucination detection

Thank you for the suggestion. Note we do offer the [Trustworthy Language Model](https://cleanlab.ai/blog/trustworthy-language-model/), which is exactly designed for hallucination detection. Relevant tutorials: https://help.cleanlab.ai/tutorials/tlm/ https://help.cleanlab.ai/tutorials/tlm_custom_model/ https://help.cleanlab.ai/tutorials/tlm_rag/#alternate-low-latencystreaming-approach-use-tlm-to-assess-responses-from-an-existing-rag-system Hallucination-detection benchmarks in RAG: https://towardsdatascience.com/benchmarking-hallucination-detection-methods-in-rag-6a03c555f063

Jonas Mueller

Simpler or current model I should use to predict probabilities?

Upgrade CI from deprecated macOS-12

Formatting Object Detection Labels - Multiple Predictions but Empty Labels

undo version caps for sklearn and huggingface hub in docs/requirements.txt

Active learning for segmentation

Can I use CleanLab for a regression task dataset with numerous (>40) numerical and categorical variables?

Hallucination detection

replace Datalab load/save from pickle to json

replace Datalab load/save from pickle to json

replace Datalab load/save from pickle to json