ArshaanNazir
ArshaanNazir
The markdown code will generate a badge that looks like this:  Clicking on the badge will take users directly to the Colab notebook. It makes it easy to access...
https://arxiv.org/abs/1903.10561
Classification: Banking77 Imdb At least one dataset for sentiment analysis Maybe one from Healthcare domain Retrieval DBPedia FiQA2018 HotpotQA FEVER STS BIOSSES STSBenchmark
PR on langchain for evaluating models using langtest.
https://github.com/BerriAI/litellm
Reference : https://textgeneration.substack.com/p/cognitive-biases-in-llms-as-evaluators?r=2abzqn&utm_campaign=post&utm_medium=web