agenta issues

Results 98 agenta issues

Sort by recently updated

Add an error message when serving the app with wrong types of the config params

**Describe the bug** Currently, there is no error message when serving the app with incorrect types of the configuration parameters. Although the app is served, it does not display the...

SamMethnani

bug

Frontend

low-priority

[Roadmap] Improvements to Human Evaluation and Annotation View

The current human evaluation and annotation view has several limitations that need to be addressed to improve user experience and functionality: 1. **Feedback Types:** Currently, only one feedback type (a...

mmabrouk

[Bug] Improve errors in evaluation view

When errors happen in the evaluation view, there are a couple of problems: - The error in the llm app are shown in black and not in red as expected...

mmabrouk

Frontend

Backend

evaluation

Evaluation: Terminate a running evaluation job

**Is your feature request related to a problem? Please describe.** We want to be able to cancel a currently running evaluation job by specifying the evaluation ID and job ID....

aybruhm

help wanted

evaluation

Run evaluation from CLI

aakrem

evaluation

Show evaluation results live while they are running

#### Current Workflow Issue: At present, our workflow for evaluating datasets using the LLM (Large Language Model) application is sequential and less efficient than it could be. The process follows...

aakrem

improvement

CLI: Introduction to Unit and Integration Testing

The CLI is a crucial component of Agenta. We plan to conduct unit and integration tests to guarantee the code quality of the CLI and its compatibility with any backend...

aybruhm

documentation

dev experience

CLI

tests

python

CI: Introduce Action Workflow for CLI Tests

We need to ensure that the CLI tests start running when a PR is raised. To accomplish this, implement an action workflow to run the CLI tests when a PR...

aybruhm

CLI

tests

ci/cd

[Issue] Custom code evaluation is quite limited in the libraries that can be used due to using RestrictedPython

RestrictedPython comes with lots of limitations, that makes lots of use cases unfeasible. One solution is to use https://github.com/glotcode/docker-run/ which provides a quick way to create docker containers to run...

mmabrouk

dev experience

improvement

[Edge case] After editing an evaluator config, new evaluations are not comparable to old evaluations

We have an evaluator with setting x We run the evaluation We edit the evaluator setting to y The results displayed are not very accurate if we display the evaluator...

aakrem

bug

Frontend

Backend

agenta
agenta copied to clipboard

Metadata

Add an error message when serving the app with wrong types of the config params

[Roadmap] Improvements to Human Evaluation and Annotation View

[Bug] Improve errors in evaluation view

Evaluation: Terminate a running evaluation job

Run evaluation from CLI

Show evaluation results live while they are running

CLI: Introduction to Unit and Integration Testing

CI: Introduce Action Workflow for CLI Tests

[Issue] Custom code evaluation is quite limited in the libraries that can be used due to using RestrictedPython

[Edge case] After editing an evaluator config, new evaluations are not comparable to old evaluations

← Metadata

Owner

Metadata

agenta agenta copied to clipboard

Metadata

← Metadata

Owner

Metadata

agenta
agenta copied to clipboard