rageval issues

change tests units

adjust the compute function in tests/units

add validate metrics

1

@FBzzh @yuanpcr you can list all potential metrics for the `validate` task in this issue. For more details about the `validate` task, you can refer to issue #13 .

Wenshansilvia

enhancement

Add support for Chinese language evaluation

1

- add jieba tokenizer - metrics: F1、Claim Recall、Rouge-L和BLEU

Wenshansilvia

List all potential test benchmarks

3

List all most used datasets in RAG researches, and we will add them to the benchmarks. - [ ] THUDM/webglm-qa from huggingface: https://huggingface.co/datasets/THUDM/webglm-qa - [ ] NaturalQuestions from huggingface: https://huggingface.co/datasets/natural_questions...

faneshion

Add embedding based ranking benchmark

Add the [DPR benchmark](https://github.com/facebookresearch/DPR) of ranking, where the model could be implemented with bert-based encoder. The embedding could be [DPR](https://github.com/facebookresearch/DPR) embedding or [BGE embedding](https://huggingface.co/BAAI/bge-large-en).

faneshion

enhancement

add generate metrics

11

@QianHaosheng @bugtig6351 @yuanpcr you can list all potential metrics for the `generate` task in this issue. For more details about the `generate` task, you can refer to issue #12 .

Wenshansilvia

enhancement

good first issue

Data quality metrics

In this issue, we discuss the potential metric used to evaluate the quality of an input dataset. The quality of dataset is very important since there are many automatically generated...

faneshion

add rank metrics

1

@RZFan525 you can list all potential metrics for the rank task in this issue.

Wenshansilvia

add rewrite metrics

@youngbeauty250 you can list all available metrics for rewrite task in this issue.

Wenshansilvia

rageval
rageval copied to clipboard

Metadata

change tests units

change the inputs of metrics

add validate metrics

Add support for Chinese language evaluation

List all potential test benchmarks

Add embedding based ranking benchmark

add generate metrics

Data quality metrics

add rank metrics

add rewrite metrics

← Metadata

Owner

Metadata

rageval rageval copied to clipboard

Metadata

← Metadata

Owner

Metadata

rageval
rageval copied to clipboard