langtest issues

Results 78 langtest issues

Sort by recently updated

Exploring LLM2LLM for Data Augmentation

**Abstract:** Large language models (LLMs) are powerful tools for natural language processing (NLP) tasks. However, their performance often suffers in low-data scenarios due to limited training data. This project investigates...

chakravarthik27

Preparing Embeddings Benchmarks (LangTest)

ArshaanNazir

Support for GPQA: A Graduate-Level Google-Proof Q&A Benchmark Dataset

Newly introduced benchmark dataset GPQA is a multiple-choice, Q&A dataset of very hard questions written and validated by experts in biology, physics, and chemistry. When attempting questions out of their...

RakshitKhajuria

⭐ Feature

implementation of a leaderboard for different quantizations (gguf 4 vs 6 vs etc bits)

**Summary:** This issue proposes the implementation of a leaderboard to compare the performance of different quantization settings (e.g., GGUF 4 bits, GGUF 6 bits, etc.) within LangTest. This leaderboard would...

chakravarthik27

Enhancing Data Quality Testing for Langtest

As Langtest prioritizes model quality assessment, it is imperative to acknowledge the profound impact of data quality on model performance. Hence, integrating comprehensive data quality testing measures becomes crucial for...

RakshitKhajuria

⭐ Feature

⏭️ Next Release

Explore Cognitive biases in LLMs

Reference : https://textgeneration.substack.com/p/cognitive-biases-in-llms-as-evaluators?r=2abzqn&utm_campaign=post&utm_medium=web

ArshaanNazir

⏭️ Next Release

Explore Context Length Parameter

ArshaanNazir

⏭️ Next Release

langtest
langtest copied to clipboard

Metadata

Exploring LLM2LLM for Data Augmentation

Preparing Embeddings Benchmarks (LangTest)

Support for GPQA: A Graduate-Level Google-Proof Q&A Benchmark Dataset

implementation of a leaderboard for different quantizations (gguf 4 vs 6 vs etc bits)

Enhancing Data Quality Testing for Langtest

Explore litellm

Preparing LLM Benchmark Table ( LangTest)

Explore MS promptbench

Explore Cognitive biases in LLMs

Explore Context Length Parameter

← Metadata

Owner

Metadata

langtest langtest copied to clipboard

Metadata

← Metadata

Owner

Metadata

langtest
langtest copied to clipboard