UnsupDIS-pytorch icon indicating copy to clipboard operation
UnsupDIS-pytorch copied to clipboard

能提供预训练模型吗?

Open 12BEN34 opened this issue 10 months ago • 0 comments

在对齐网络网络训练时,出现中断 CUDA runtime error: an illegal memory access was encountered (700) in apply_lu_factor_batched_magma at /opt/conda/conda-bld/pytorch_1656352463056/work/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:1961 Images loss1 loss2 loss3 total PSNR SSIM terminate called after throwing an instance of 'c10::CUDAError' what(): CUDA error: an illegal memory access was encountered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Exception raised from record at /opt/conda/conda-bld/pytorch_1656352463056/work/aten/src/ATen/cuda/CUDAEvent.h:115 (most recent call first): frame #0: c10::Error::Error(c10::SourceLocation, std::string) + 0x57 (0x7f456f9a4477 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libc10.so) frame #1: + 0xd2b58 (0x7f45a97b3b58 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_cuda_cpp.so) frame #2: + 0xd60ce (0x7f45a97b70ce in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_cuda_cpp.so) frame #3: + 0x458118 (0x7f45af78b118 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_python.so) frame #4: c10::TensorImpl::release_resources() + 0x175 (0x7f456f987d95 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libc10.so) frame #5: + 0x34cf35 (0x7f45af67ff35 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_python.so) frame #6: + 0x681508 (0x7f45af9b4508 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_python.so) frame #7: THPVariable_subclass_dealloc(_object*) + 0x2d5 (0x7f45af9b4825 in /home/ax/anaconda3/envs/pt/lib/python3.8/site-packages/torch/lib/libtorch_python.so) frame #30: __libc_start_main + 0xf3 (0x7f45fbef9083 in /lib/x86_64-linux-gnu/libc.so.6) te 已放弃 (核心已转储)

12BEN34 avatar Apr 16 '24 06:04 12BEN34