ybalbert001

Results 9 issues of ybalbert001

from FlagEmbedding import BGEM3FlagModel 通过这种方式进行部署推理的,但是压测的时候,发现CPU利用率达到200%+, GPU利用率仅仅2%,T4的卡,有啥建议? [INFO ] WorkerPool - loading model bge_m3_deploy_code (PENDING) on gpu(0) ... -- [INFO ] ModelInfo - S3 url found, start downloading from s3://sagemaker-us-west-2-106839800180/LLM-RAG/workshop/bge-m3-model/ [INFO...

### Search for answers in existing issues - [X] I have searched issues, there is no issue related to the problem I encountered ### Python version python 3.10 ### Issue...

bug

# Checklist: > [!IMPORTANT] > Please review the checklist below before submitting your pull request. - [x] Please open an issue before creating a PR or link to an existing...

size:XL
🔨 feat:tools

# Checklist: > [!IMPORTANT] > Please review the checklist below before submitting your pull request. - [x] Please open an issue before creating a PR or link to an existing...

🐞 bug
size:M
lgtm

### Self Checks - [x] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [x] I have searched for existing...

🐞 bug
tts

### 🚀 The feature, motivation and pitch ``` class KVTransferConfig(BaseModel): """Configuration for distributed KV cache transfer.""" # The KV connector for vLLM to transmit KV caches between vLLM instances. kv_connector:...

feature request
stale

It will always scroll down to the bottom of conversation.

The Desktop version's "New Project" button is wrapped by the conditional logic projects.length > 0, causing it to not display when there are no projects, preventing users from creating new...

**In which component is this bug present?** - [ ] 01-AgentCore-runtime - [x] 02-AgentCore-gateway - [ ] 03-AgentCore-identity - [ ] 04-AgentCore-memory - [ ] 05-AgentCore-tools - [ ] 06-AgentCore-observability...

bug
01-tutorials
02-AgentCore-gateway