Sebastian.W

Results 28 issues of Sebastian.W

I was not able to download problems after login. I'm running the plugin behind corporate firewall, I am sure the proxy setting is working fine. The LeetCode log as below:...

Thanks for your great work and kindly open source to the public community! I am wondering how to finetune(or maybe other correct terms) this model with my own dataset? I...

Can anyone help me on this error? > RuntimeError: CUDA out of memory. Tried to allocate 136.00 MiB (GPU 0; 22.20 GiB total capacity; 21.35 GiB already allocated; 64.12 MiB...

I am using the latest vllm docker image, trying to run Mixtral 8x7b model quantized in AWQ format. I got error message as below: ``` INFO 12-24 09:22:55 llm_engine.py:73] Initializing...

我搜索了代码,看起来现在是在代码中hard code了使用Qwen的模型进行内容生成。请问有计划支持别的模型吗?或者支持以接口调用的形式集成其它LLM推理服务吗?

Currently, the official ollama container image doesn't contain necessary cuda libraries. This is really inconvenient when run it on server. I see you have provided [rocm] images for AMD gpus,...

### 🥰 Feature Description ChatGPT-like voice chat mode. ### 🧐 Proposed Solution LobeChat has already had the ability of TTS and STT, can we move one step forward, enable the...

Inactive
🌠 Feature Request

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to submit this...

💪 enhancement

I noticed that the code is licensed under Apache 2.0, but the dataset and checkpoints are licensed under CC-NC-4.0. As per my understanding, the license forbid usage of the dataset...

**Is your feature request related to a problem? Please describe.** Nowadays, embedding + reranker is the SOTA solution to improve the accuracy of RAG system. We've already have the embedding...

enhancement
up for grabs
roadmap