DB-GPT [Bug] LLMServer Generate Error, Please CheckErrorInfo.: CUDA error

[Bug] LLMServer Generate Error, Please CheckErrorInfo.: CUDA error

Open langmansh opened this issue 10 months ago • 1 comments

Search before asking

[X] I had searched in the issues and found no similar issues.

Operating system information

Windows

Python version information

3.10

DB-GPT version

main

Related scenes

[x] Chat Data
[ ] Chat Excel
[ ] Chat DB
[ ] Chat Knowledge
[ ] Model Management
[ ] Dashboard
[ ] Plugins

Installation Information

Device information

CPU and GPU

Models information

chatglm3-6b bge-large-zh-v1.5

What happened

当我向chat data提问的时候，使用cuda的时候报错：ERROR!Model server error!code=1, error msg is LLMServer Generate Error, Please CheckErrorInfo.: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

使用cpu的时候报错：ERROR!Model server error!code=1, error msg is LLMServer Generate Error, Please CheckErrorInfo.: index 26602 is out of bounds for dimension 0 with size 8192

What you expected to happen

请问这两个问题如何解决的？

How to reproduce

请问这两个问题如何解决的？

Additional context

No response

Are you willing to submit PR?

[X] Yes I am willing to submit a PR!

Apr 10 '24 03:04 langmansh

This issue has been marked as stale, because it has been over 30 days without any activity.

May 10 '24 21:05 github-actions[bot]

This issue bas been closed, because it has been marked as stale and there has been no activity for over 7 days.

May 18 '24 21:05 github-actions[bot]

我本地部署的chatglm3-6b模型也遇见这样的错误

Jun 05 '24 07:06 zhangkuo-zk

DB-GPT DB-GPT copied to clipboard

[Bug] LLMServer Generate Error, Please CheckErrorInfo.: CUDA error

Search before asking

Operating system information

Python version information

DB-GPT version

Related scenes

Installation Information

Device information

Models information

What happened

What you expected to happen

How to reproduce

Additional context

Are you willing to submit PR?

DB-GPT
DB-GPT copied to clipboard