runningREAL

Results 3 comments of runningREAL

@j9liu Thank you for your reply. It turns out that this feature is not available. No wonder I can't find it. I thought for a moment. I probably can only...

我直接按教程最原始的配置跑也这样,单个对话可以,一旦多个对话,就会自动等待第一个对话完成后,才能进行第二个对话的回复。而用其他框架多个对话调用api时就会直接报错类似这种 RuntimeError: [address=0.0.0.0:42121, pid=481583] probability tensor contains either `inf`, `nan` or element < 0 完全满足不了并发要求啊,我测试用的qwen1.5-1.8b-chat,导致现在不敢用下去了

有其他带rag和agent的推荐项目吗?lccc实在是太久没有更新了