Kalyan Chakravarthy Thadaka
Kalyan Chakravarthy Thadaka
Implement functionality to handle prompts specific to different datasets, enhancing versatility and adaptability of the system for prompt-based tasks across diverse datasets.
**Abstract:** Large language models (LLMs) are powerful tools for natural language processing (NLP) tasks. However, their performance often suffers in low-data scenarios due to limited training data. This project investigates...
**Summary:** This issue proposes the implementation of a leaderboard to compare the performance of different quantization settings (e.g., GGUF 4 bits, GGUF 6 bits, etc.) within LangTest. This leaderboard would...
The current testing framework lacks comprehensive coverage for one-liners, other classes, and the workflow testing process. This deficiency in the test suite has resulted in an increased risk of undetected...
### Description Currently, the Harness system imports test cases from external sources, focusing primarily on robustness and bias categories. This process involves filtering and selectively integrating these specific categories, potentially...
healthcare-related tests: [HPAI-BSC/better-safe-than-sorry · Datasets at Hugging Face](https://huggingface.co/datasets/HPAI-BSC/better-safe-than-sorry)
``` AttributeError: 'NERPrediction' object has no attribute 'label' ```