weiminw comments

Results 14 comments of


                                            weiminw

How to get a new fresh state machine instance which is distributed?

@jvalkeal any update on this?

OpenAI Tools / function calling v2

hope this feature release as soon as possible.

How to action when output isn't finished

@wenfengwang Do you get the solution at last. or Is there way to get full content when user input word "continue"?

部署了Qwen1.5-32B-Chat-GPTQ-Int4可以运行，但出现了CUDA extension not installed，推理速度很慢

32B-Chat-AWQ 在A100 40G上跑回复差不多相同的内容,时间大约是20秒, 14B-Chat-AWQ 在4090 24G上跑, 回复差不多6秒内. 是不是我需要做什么配置才能让32B-Chat-AWQ 推理速度快一些?

BGE-M3如何在RAG应用中使用Hybrid Retrieve

我能想到的使用流程大概是这样的, 对原始的文档, 通过BGE-M3 进行向量化, 由于M3 可以同时返回dense 和 sparse embeding. 将两种向量同时存入miluvs中不同的列, 检索的时候, 使用M3 将query同时向量化dense和sparse 同时进行检索. 获得dense+ sparse 再进行rerank. 这个思路是否正确?

BGE-M3如何在RAG应用中使用Hybrid Retrieve

非常感谢您的回复，祝新年快乐，bge越来越好

是否有多语言的embedding 模型支持

> 您好，多语言在开发中，大概还有1个月的时间发布下一版本。多语言版本是否有了? 请问是哪一个呢? 期待您的回复

More language support?

need Chinese support

Yi-34B-Chat-4bit 无法配合langchain作为agent使用

> 这是因为目前的Yi-Chat模型暂不支持funcation call 我在prompt中提示，如果没有工具，则直接回答，好像34bchat 没有完全遵从prompt。如何能让34bchat能严格遵从指令呢

200K 上下文的模型什么时候能放出 chat 模型呢？

同求Yi-34B-200K-Chat 4bits 模型。万分感激