Hao Zhang
Hao Zhang
@suquark ? easy fix?
@iRanadheer : Could you describe your use case? If neither CLI nor web server works, what kind of interface do you expect? Recently @suquark adds a feature for openAI-like API,...
openAI-like APIs have been added. Now you can use python's request library to send post to the openai-like endpoint. Closing.
I suppose you are using the web interface? This shouldn't happen when you clear the history. Could you provide more details?
we'll investigate LangChain integration and update in the thread
Please monitor #381
We won't fix this in the short term. Because this is a complicated lack of support in FSDP. It is hard for us to fix FSDP. We suggest you use...
I'm closing this issue because this seems to be a flash attention issue. We'll soon migrate to use the xformer (https://github.com/facebookresearch/xformers) in place of flashattention, as our internal tests show...
is websocket allowed on your sagemaker env?
Are you serving your model to many users? If you have much traffic, the semaphore will allow only 4 users to use. The rest of the users have to wait...