Kalyan Chakravarthy Thadaka issues

Results 39 issues of


                                            Kalyan Chakravarthy Thadaka

Implement Accuracy Drop for Robustness and Bias Tests

This implementation involves comparing the ground truth vs. expected result and the ground truth vs. actual result, where the actual result is derived from a perturbed version of the original...

⭐ Feature

Image-Text-to-Text Support in Transformers Pipeline

### Feature request Implement the new feature to support a pipeline that can take both an image and text as inputs, and produce a text output. This would be particularly...

Feature request

Feat/implement mts dialog based clinical summary evaluation

This pull request introduces several changes to the `langtest` module, primarily focusing on enhancing functionality and improving code structure. The most important changes include the addition of dialogue-related columns, the...

update: enhancing by migrating pydantic v1 basemodel to v2

Migrate BaseModel` Usage from Pydantic v1 to Latest Version in Sample Classes

**Current State** The `langtest` repository currently uses `pydantic.v1.BaseModel` from Pydantic v1 across its sample classes for data modeling and validation. With the release of Pydantic v2, several API changes and...

💡Enhancements

Implement MTS-Dialog-Based Clinical Summary Evaluation

**Description:** This issue aims to integrate the **MTS-Dialog** dataset into the LangTest framework, enabling clinical summarization evaluation. The goal is to support structured, medically accurate summarization assessments using this domain-specific...

⭐ Feature

Explore the ML Commons Benchmarks

**Background**: [MLCommons ](https://ailuminate.mlcommons.org/benchmarks/) is a global AI engineering consortium that focuses on improving **accuracy**, **safety**, **speed**, and **efficiency** of AI systems through open collaboration and standardized benchmarks. Their mission includes...

Refactoring the architecture of the LangTest

* reforms the architecture * introduce the modular approach

⭐ Feature

🔧 Refacto

Implement the ChatDoctor DataSet Support for Clincial

ChatDoctor

⭐ Feature