Yunlin Mao issues

Results 4 issues of


                                            Yunlin Mao

[WIP] Add multimodal RAG evaluation support

- [x] Add `CLIP_benchmark` - [x] zero-shot retrieval evaluation - [x] zero-shot classification evaluation - [ ] Add LLM image caption and embedding retrieval support. - [ ] Add end-to-end...

docs: add modelscope endpoint

## Description To integrate ModelScope inference API endpoints for both Embeddings, LLMs and ChatModels, install the package `langchain-modelscope-integration` (as discussed in issue #28928 ). This is necessary because the package...

🤖:docs

size:XL

[Feature] Support custom evaluation metrics

## 功能描述 / Feature Description Support custom evaluation metrics ## 需求背景 / Background 为什么需要这个功能？ / Why is this feature needed? ## 预期行为 / Expected Behavior 这个功能应该如何工作？ / How should this...

enhancement

[🎯 Roadmap] EvalScope Roadmap

## English Version ### Planned Benchmarks Support #### 1. Agent - [x] 𝜏²-Bench #959 - [ ] Terminal-Bench #### 2. Code - [ ] Multi-E - [x] SciCode - [x]...