tonic_validate
tonic_validate copied to clipboard
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
Bumps [openai](https://github.com/openai/openai-python) from 1.23.2 to 1.29.0. Release notes Sourced from openai's releases. v1.29.0 1.29.0 (2024-05-13) Full Changelog: v1.28.2...v1.29.0 Features api: add gpt-4o model (#1417) (4f09f8c) v1.28.2 1.28.2 (2024-05-13) Full Changelog:...
Bumps [litellm](https://github.com/BerriAI/litellm) from 1.35.20 to 1.37.7. Release notes Sourced from litellm's releases. v1.37.7 What's Changed [Feat] send weekly spend reports by Team/Tag by @ishaan-jaff in BerriAI/litellm#3609 build(deps): bump next from...
This PR aims to add support for models available through [OpenRouter](https://openrouter.ai/). Let me know if any bugs or issues.
Bumps [pytest](https://github.com/pytest-dev/pytest) from 8.1.1 to 8.2.0. Release notes Sourced from pytest's releases. 8.2.0 pytest 8.2.0 (2024-04-27) Deprecations #12069: A deprecation warning is now raised when implementations of one of the...
Bumps [tonic-textual](https://www.tonic.ai/) from 1.2.1 to 1.4.0. [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a...
I run into this error when I try to set up validate ``` --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) Cell In[1], line 6 4 from os import environ 5...
Hello, please add the ability to have a fixed number of runs per question instead of 1 and report average and stdev of all metrics (perhaps min/max or some sort...
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4. Release notes Sourced from jinja2's releases. 3.1.4 This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise...
Hello, could you please add support for more fine-grained metrics, possibly user-definable, like time to get embeddings, time to retrieve chunks from a vector database, time to get first bytes...