PocketFlow
PocketFlow copied to clipboard
InternalError (see above for traceback): Blas SGEMM launch failed : m=802816, n=64, k=32
When I perform channel pruning on the mobilenet at ilsvrc12 dataset,this error occured. But the pruning at cifar10 dataset can be done normally.
Maybe something related to the GPU memory? https://stackoverflow.com/questions/37337728/tensorflow-internalerror-blas-sgemm-launch-failed
Maybe something related to the GPU memory? https://stackoverflow.com/questions/37337728/tensorflow-internalerror-blas-sgemm-launch-failed
My machine is GTX2080,the GPUmemory is 8G,I dont know if i can finish the pruning...
Could you try solutions provided in the above stack-overflow link, and see if anything helps?
anything I'm sure I only run a tensorflow program at the same time and have reinstalled the tensorflow-gpu,it didn't worked.
Maybe this one? https://stackoverflow.com/a/43130779/10611647
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.3)
sess = tf.Session(config=tf.ConfigProto(
allow_soft_placement=True, log_device_placement=True))
Could you try solutions provided in the above stack-overflow link, and see if anything helps?
I'm sure I only run a tensorflow program at the same time and have reinstalled the tensorflow-gpu,it didn't worked.
Maybe this one? https://stackoverflow.com/a/43130779/10611647 gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.3) sess = tf.Session(config=tf.ConfigProto( allow_soft_placement=True, log_device_placement=True))
I have tried,although I'm not sure where to put it.

How many GPU cards do you have?
How many GPU cards do you have?
only one...
Try to reduce the batch size?
Try to reduce the batch size?
I have reduced the batch_size_eval to 1
If the error occurs in the training process, then you should reduce FLAGS.batch_size instead of FLAGS.batch_size_eval.
If the error occurs in the training process, then you should reduce FLAGS.batch_size instead of FLAGS.batch_size_eval.
It didn't work...
Any updates? Still not working?
Hey bro, have u figured it out ? I met the same issue
plz if you solve this problem, let me know how to solve it,,,
I encountered the same issue when I run my code at the machine of the GTX2080(the signal GPU memory is 8G, total have two card), the error info as the following:
InternalError (see above for traceback): Blas SGEMM launch failed : m=53290, n=80, k=64
[[node while/AdvInceptionV3/AdvInceptionV3/Conv2d_3b_1x1/Conv2D (defined at /home/suy/.pyenv/versions/mypython3.6/lib/python3.6/site-packages/tensorflow/contrib/layers/python/layers/layers.py:1057) = Conv2D[T=DT_FLOAT, data_format="NHWC", dilations=[1, 1, 1, 1], padding="VALID", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](while/AdvInceptionV3/AdvInceptionV3/MaxPool_3a_3x3/MaxPool, while/AdvInceptionV3/AdvInceptionV3/Conv2d_3b_1x1/kernel/Regularizer/l2_regularizer/L2Loss/Enter)]]
[[{{node while/Exit/_791}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_4223_while/Exit", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
However, I could run the same code at another machine of the GTX2080(the signal GPU memory is 10G, total have two card).
I still don't know why.
I fixed this issue just by installing the patches of CUDA_Toolkit @Donald-Su @0113bernoyoun
I fixed this issue just by installing the patches of CUDA_Toolkit @Donald-Su @0113bernoyoun
Hi ShuteLee, the machine installed the CUDA_Toolkit, but still have the issue
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2017 NVIDIA Corporation
Built on Fri_Sep__1_21:08:03_CDT_2017
Cuda compilation tools, release 9.0, V9.0.176
I fixed this issue just by installing the patches of CUDA_Toolkit @Donald-Su @0113bernoyoun
Hi ShuteLee, the machine installed the CUDA_Toolkit, but still have the issue
nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2017 NVIDIA Corporation Built on Fri_Sep__1_21:08:03_CDT_2017 Cuda compilation tools, release 9.0, V9.0.176
Please be sure that you have installed the four PATCHES
https://developer.nvidia.com/cuda-90-download-archive?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=runfilelocal
I fixed this issue just by installing the patches of CUDA_Toolkit @Donald-Su @0113bernoyoun
Hi ShuteLee, the machine installed the CUDA_Toolkit, but still have the issue
nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2017 NVIDIA Corporation Built on Fri_Sep__1_21:08:03_CDT_2017 Cuda compilation tools, release 9.0, V9.0.176Please be sure that you have installed the four PATCHES
https://developer.nvidia.com/cuda-90-download-archive?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=runfilelocal
There is not the package for my OS of the ubuntu 18.04
I fixed this issue just by installing the patches of CUDA_Toolkit @Donald-Su @0113bernoyoun
Hi ShuteLee, the machine installed the CUDA_Toolkit, but still have the issue
nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2017 NVIDIA Corporation Built on Fri_Sep__1_21:08:03_CDT_2017 Cuda compilation tools, release 9.0, V9.0.176Please be sure that you have installed the four PATCHES https://developer.nvidia.com/cuda-90-download-archive?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=runfilelocal
There is not the package for my OS of the ubuntu 18.04
So, maybe the CUDA Tookit 9.0 is not so compatible with your Ubuntu 18.04. you can choose a more recent version.