continuous-eval icon indicating copy to clipboard operation
continuous-eval copied to clipboard

Open-Source Evaluation for GenAI Application Pipelines

Results 11 continuous-eval issues
Sort by recently updated
recently updated
newest added

# Add SQL Metrics Implementation This pull request introduces the `SQLSyntaxMatch` class, which is designed to evaluate the syntactic similarity between generated SQL queries and a set of ground truth...

_This change addresses [review comments](https://github.com/relari-ai/continuous-eval/pull/59#pullrequestreview-2061835523) left by [@yisz](https://github.com/yisz) on PR #59: **Add SQL Metrics Implementation**_ > :warning: We couldn't build/test your project to verify our changes. [Add a Dockerfile](https://docs.ellipsis.dev/build) to...

Issue: reduce the need for inference providers with continuous-eval. Advantage: Allow users to use multiple LLMs with fallbacks/caching without building the core infra.

enhancement