Transformer跑不通
Transformer.py文件中的out = x + nn.Parameter(self.pe, requires_grad=False).to(self.device)报错:
RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
这是为什么
Transformer.py文件中的out = x + nn.Parameter(self.pe, requires_grad=False).to(self.device)报错: RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with
TORCH_USE_CUDA_DSAto enable device-side assertions. 这是为什么
参考这个看看:https://discuss.pytorch.org/t/how-to-fix-cuda-error-device-side-assert-triggered-error/137553/3