promptfoo issues

Add more deterministic/math-based assertions

4

**Is your feature request related to a problem? Please describe.** Some of the common math-based evaluation metrics for NLP/LLM includes ROUGE (already supported), BLEU, METEOR, GLEU and some others. See...

sinedied

enhancement

help wanted

good first issue

[Web Page] Display the output texts in other format like Markdown at the evaluation page

4

**Is your feature request related to a problem? Please describe.** No **Describe the solution you'd like** I'd like an option that I can see the output texts on eval web...

tianzhich

enhancement

Automatically load .env files, searching up in folder hierarchy

5

**Is your feature request related to a problem? Please describe.** When using providers, we need to set env variables to be exported in our env to make them work. In...

sinedied

enhancement

is-sql Assertion not working

3

**Describe the bug** When using the followin sql assert something fails. **To Reproduce** 1. Add the following assertion to your test suit: ] ``` - description: "Specific SQL Assertion" vars:...

SysOverdrive

bug

command line option to only retry failed requests (eg. 500 server error or API returning nothing)

4

**Is your feature request related to a problem? Please describe.** I am abusing Promptfoo to perform benchmarking. If the model FAILs, I most often don't want to retry the prompt....

mpasternak

Open Source

Results are not persisted even with a volume and PROMPTFOO_SHARE_STORE_TYPE=filesystem

4

Hello, I deployed promptfoo on a VM using a Docker-compose and the pre-built image available at ghcr.io/promptfoo/promptfoo:main ![promptfoo_persistence_01 png](https://github.com/user-attachments/assets/c97d6d27-cbe7-47fe-8a43-c5c2de73ba06) ![promptfoo_persistence_02 png](https://github.com/user-attachments/assets/d5678dba-cd2b-4b3d-bf7b-784a4364aea0) However, if I run several evaluations and then explore...

Valkea

bug

Open Source

Skip assertion decision

1

**Is your feature request related to a problem? Please describe.** I am testing a large number of test cases using a specified python metric set. In the case of some...

sjsinju

enhancement

question

context-recall prompt not returning expected answer format.

2

**Describe the bug** The `context-recall` [prompt](https://github.com/promptfoo/promptfoo/blob/b08099b2c6a7bb32866d63ad9cde7d79f37423ae/src/prompts/external/ragas.ts#L11-L25) is expected by the [matchesContextRecall](https://github.com/promptfoo/promptfoo/blob/b08099b2c6a7bb32866d63ad9cde7d79f37423ae/src/matchers.ts#L632-L680) to produce a list with of *single* sentence/line statements followed by an [Attributed|NotAttributed] marker, like it is illustrated in...

AlexRRR

bug

Open Source

Add perma/direct link to test result in eval view

1

Add the ability to permalink directly to a result row in the eval view

sklein12

enhancement

good first issue

Setting default `--grader` on command line fails

**Describe the bug** Setting `--grader` on command line causes test failure. **To Reproduce** 1. promptfooconfig.yaml: ``` description: 'minimal repo' prompts: - 'test prompt {{input}}' providers: - "openai:chat:gpt-4o" defaultTest: assert: -...

typpo

bug

promptfoo
promptfoo copied to clipboard

Metadata

Add more deterministic/math-based assertions

[Web Page] Display the output texts in other format like Markdown at the evaluation page

Automatically load .env files, searching up in folder hierarchy

is-sql Assertion not working

command line option to only retry failed requests (eg. 500 server error or API returning nothing)

Results are not persisted even with a volume and PROMPTFOO_SHARE_STORE_TYPE=filesystem

Skip assertion decision

context-recall prompt not returning expected answer format.

Add perma/direct link to test result in eval view

Setting default `--grader` on command line fails

← Metadata

Owner

Metadata

promptfoo promptfoo copied to clipboard

Metadata

← Metadata

Owner

Metadata

promptfoo
promptfoo copied to clipboard