Yunlin Mao
Yunlin Mao
- [x] Add `CLIP_benchmark` - [x] zero-shot retrieval evaluation - [x] zero-shot classification evaluation - [ ] Add LLM image caption and embedding retrieval support. - [ ] Add end-to-end...
## Description To integrate ModelScope inference API endpoints for both Embeddings, LLMs and ChatModels, install the package `langchain-modelscope-integration` (as discussed in issue #28928 ). This is necessary because the package...
## 功能描述 / Feature Description Support custom evaluation metrics ## 需求背景 / Background 为什么需要这个功能? / Why is this feature needed? ## 预期行为 / Expected Behavior 这个功能应该如何工作? / How should this...
## English Version ### Planned Benchmarks Support #### 1. Agent - [x] 𝜏²-Bench #959 - [ ] Terminal-Bench #### 2. Code - [ ] Multi-E - [x] SciCode - [x]...