Richard Li comments

Results 45 comments of


                                            Richard Li

"error: the path "usersvc/deployment.yaml" does not exist"

Hi, thanks for the bug report! I'm not sure switching to a Makefile versus bash would really simplify things -- my preference would be to fix the root cause. But...

generate quark files from protobuf - protoc plugin request

Thanks for the request. Yes, it's definitely possible, and we've discussed providing importers for Swagger, Protobuf, et al. If you have a specific test case or example, we'd love to...

Bolt Slack API is very unclear on multiple workspace install

Hi @hello-ashleyintech! Thanks for the prompt response. Yes, I figured all the above out. The other thing that isn't mentioned is how to write your Slack event handler to handle...

Bolt Slack API is very unclear on multiple workspace install

Hi @hello-ashleyintech. Thanks for all your help. I've gotten most (?) of this working, but I have one more error I can't figure out. 1. I've created an OAuth settings...

Bolt Slack API is very unclear on multiple workspace install

I'm going to close this issue. I do think the documentation for Python can be improved, because there's quite a bit of magic going on behind the scenes that is...

Installation Issues with outlines 0.1.0 on Linux (Google Colab) and MacOS

See #1198

[Bug]: vllm.engine.async_llm_engine.AsyncEngineDeadError: Background loop has errored already.

I can trigger this error reliably when sending requests with larger amounts of tokens. I've reproduced this on both `meta-llama/Meta-Llama-3-8B-Instruct` and `mistralai/Mistral-7B-Instruct-v0.1`. In my situation, I'm deploying vLLM on a...

[Bug]: vllm.engine.async_llm_engine.AsyncEngineDeadError: Background loop has errored already.

I did some additional experimentation: * On a 64GB VM, CPU only, I was able to successfully trigger the error with a 351 token prompt. * On a 128GB VM,...

[Bug]: vllm.engine.async_llm_engine.AsyncEngineDeadError: Background loop has errored already.

I've had some success by increasing `ENGINE_ITERATION_TIMEOUT_S`. It appears the offending code is here: (see https://github.com/vllm-project/vllm/blob/main/vllm/engine/async_llm_engine.py#L630). When the engine takes too long, it times out, but then leaves the engine...

Would be good to add the equivalent of the Copilot Chat

Note that GitHub Copilot Chat is a separate extension https://marketplace.visualstudio.com/items?itemName=GitHub.copilot-chat than the main code completion. I agree it would be super useful; I'm currently looking for a plug-in that does...