q yao comments

Results 318 comments of


                                            q yao

[Bug] RuntimeError: CUDA error: an illegal memory access was encountered

@zhyxun 麻烦提供一下复现方式还有环境信息

[Bug] RuntimeError: CUDA error: an illegal memory access was encountered

@zhyxun 我这里没办法复现。能提供一张确定能复现的图片数据吗？还有大约跑多少数据会发生错误？

[Bug] RuntimeError: CUDA error: an illegal memory access was encountered

@zhyxun https://github.com/InternLM/lmdeploy/pull/3307 试试看

[Bug] RuntimeError: CUDA error: an illegal memory access was encountered

> ``` > lmdeploy - ERROR - async_engine.py:592 - [safe_run] exception caught: GeneratorExit > ``` > 这个一般是请求连接相关的错，和引擎关系不大，@AllentDan 能不能帮忙看下

[Bug] Error Exporting RetinaNet for Single Class Case with CrossEntropyLoss in MMDeploy

Thanks for the notification. This should be a bug of a big refactor https://github.com/open-mmlab/mmdeploy/pull/1091. Removing either of the slices should be ok.

[Bug] Error Exporting RetinaNet for Single Class Case with CrossEntropyLoss in MMDeploy

> I can create a PR for this patch too if it helpss. Sure, that would be cool.

[Bug] triton.runtime.autotuner.OutOfResources: out of resource: shared memory, Required: 108672, Hardware limit: 101376. Reducing block sizes or `num_stages` may help.

https://github.com/InternLM/lmdeploy/blob/edcdd8e36520b8bf7dbc99feecd2d2822c4cb5ba/lmdeploy/pytorch/kernels/cuda/pagedattention.py#L35 https://github.com/InternLM/lmdeploy/blob/edcdd8e36520b8bf7dbc99feecd2d2822c4cb5ba/lmdeploy/pytorch/kernels/cuda/pagedattention.py#L583 可以试试看把这些地方的 num_stages 改成 1

[Bug]对VLM来说，是否支持多个lora adapter使用

如果 adapter 对 vision 的部分做了修改那么暂时还没有支持

pipeline warmup

`lmdeploy serve api_server` + `profile_restful_api.py` failed with log: ``` 2025-05-14 19:54:29,323 - lmdeploy - ERROR - async_engine.py:599 - [safe_run] exception caught: AttributeError 'NoneType' object has no attribute 'get' ```

[Bug] lmdeploy - ERROR - base.py:53 - RuntimeError: Internal Triton PTX codegen error: ptxas fatal : Value 'sm_120' is not defined for option 'gpu-name'

We will do some environment checks before starting engine. As the log indicate, the package triton check failed. Our triton check is a simple vector add kernel. https://github.com/InternLM/lmdeploy/blob/05914be1e0ef3ceb9d0ce37ca4912bc3ec2e2864/lmdeploy/pytorch/check_env/triton_custom_add.py#L28-L33 Update your...