PaddleX icon indicating copy to clipboard operation
PaddleX copied to clipboard

cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device

Open hurui79 opened this issue 3 years ago • 6 comments

描述问题

训练时报错

This log file path is D:\paddlex_workspace\projects\P0006\T0030\err.log 注意:标志为WARNING/INFO类的仅为警告或提示类信息,非错误信息 D:\Download\PaddleX_GUI_2.1.0_win10\paddle\tensor\creation.py:130: DeprecationWarning: np.object is a deprecated alias for the builtin object. To silence this warning, use object by itself. Doing this will not modify any behavior and is safe. Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations if data.dtype == np.object: Process Process-1:2: Traceback (most recent call last): File "multiprocessing\process.py", line 297, in _bootstrap File "multiprocessing\process.py", line 99, in run File "paddlexui\pms\model_tasks\tasks.py", line 73, in _call_paddlex_train File "paddlexui\pms\model_tasks\train\detection.py", line 224, in train File "D:\Download\PaddleX_GUI_2.1.0_win10\paddlex\cv\models\detector.py", line 1129, in init dcn_v2_stages=dcn_v2_stages) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddlex\cv\models\detector.py", line 101, in _get_backbone backbone = getattr(ppdet.modeling, backbone_name)(**params) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddlex\ppdet\modeling\backbones\resnet.py", line 528, in init lr=1.0)) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddlex\ppdet\modeling\backbones\resnet.py", line 69, in init bias_attr=False) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\nn\layer\conv.py", line 656, in init data_format=data_format) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\nn\layer\conv.py", line 135, in init default_initializer=_get_default_param_initializer()) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\dygraph\layers.py", line 424, in create_parameter default_initializer) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\layer_helper_base.py", line 378, in create_parameter **attr._to_kwargs(with_initializer=True)) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\framework.py", line 3137, in create_parameter initializer(param, self) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\initializer.py", line 366, in call stop_gradient=True) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\framework.py", line 3167, in append_op kwargs.get("stop_gradient", False)) File "D:\Download\PaddleX_GUI_2.1.0_win10\paddle\fluid\dygraph\tracer.py", line 45, in trace_op not stop_gradient) SystemError: (Fatal) Operator gaussian_random raises an class thrust::system::system_error exception. The exception content is :parallel_for failed: cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device. (at ..\paddle\fluid\imperative\tracer.cc:221)

复现

  1. 请提供您出现的报错信息及相关log(log的查找见 FAQ Q2image

  2. 请提供您使用的GUI版本号 2.1.0

  3. 请提供您使用的操作系统信息,如Linux/Windows/MacOS windows10

  4. 请问您使用的CUDA/cuDNN的版本号是? cuda11.1 cudnn8.11

hurui79 avatar Mar 11 '22 07:03 hurui79

image pip list 部分截图 image run_check截图

补充说明官方数据集也同样报错

hurui79 avatar Mar 11 '22 07:03 hurui79

image 训练的参数如上

hurui79 avatar Mar 11 '22 07:03 hurui79

看cuda和cudnn的版本也是符合要求的,您要不先试下PaddleX API模式能不能正常运行: https://github.com/PaddlePaddle/PaddleX

image

FlyingQianMM avatar Mar 23 '22 03:03 FlyingQianMM

看cuda和cudnn的版本也是符合要求的,您要不先试下PaddleX API模式能不能正常运行: https://github.com/PaddlePaddle/PaddleX

image

。。。有别的方法么,使用paddlex不就是看上他的gui界面简单哈用么,,,,

hurui79 avatar Mar 23 '22 03:03 hurui79

您用的是什么样的显卡,看错误是cuda版本不兼容

lailuboy avatar Apr 18 '22 11:04 lailuboy

您用的是什么样的显卡,看错误是cuda版本不兼容

image @lailuboy

hurui79 avatar Apr 26 '22 03:04 hurui79