minyichen comments

Results 7 comments of


                                            minyichen

ARM aarch-64 server build failed (host OS: Ubuntu22.04.3)

@yexing @zhudy Excuse me. I face the same problem. I cloned vllm into my project. and add `nvcc_cuda_version = get_nvcc_cuda_version(CUDA_HOME) ` to setup.py at line 268 But still have same...

show_copy_button stops working when server_name is not localhost

@abidlabs Hi, I face the same problem. Do you have any suggestion? Thanks in advance!

是否可以增加langchain agent调用Qwen模型的代码示例

@ZhuJD-China @JianxinMa 我這邊使用14B的結果也不甚理想, 會選擇工具, 但並不會實際使用工具 7B、72B是否對於ReAct prompt有加強學習, 甚至可以支援多輪ReAct? ``` from langchain.chat_models import ChatOpenAI from langchain.agents import load_tools, initialize_agent, AgentType from langchain import SerpAPIWrapper llm = ChatOpenAI( temperature=0, # max_tokens=90, streaming=...

是否可以增加langchain agent调用Qwen模型的代码示例

使用QWEN-7B測試, 看起來似乎有使用工具, 但實際上並沒有真的去呼叫SERP_API ``` from langchain.chat_models import ChatOpenAI from langchain.agents import load_tools, initialize_agent, AgentType from langchain import SerpAPIWrapper llm = ChatOpenAI( temperature=0, # max_tokens=90, streaming= True, openai_api_key="EMPTY", openai_api_base="http://localhost:8000/v1", model_name="/usr/src/app/model/Qwen-7B-Chat-AWQ" )...

minyichen

ARM aarch-64 server build failed (host OS: Ubuntu22.04.3)

show_copy_button stops working when server_name is not localhost

是否可以增加langchain agent调用Qwen模型的代码示例

是否可以增加langchain agent调用Qwen模型的代码示例

是否可以增加langchain agent调用Qwen模型的代码示例

finetune_chat 运行错误

Why is the inference FTL@1 longer after the vllm framework is quantized?