paper-end2end-dl icon indicating copy to clipboard operation
paper-end2end-dl copied to clipboard

Replicating experiments with CUDA version > 8.0

Open zehanort opened this issue 4 years ago • 3 comments

Hello Chris!

I am trying to replicate your first case study regarding OpenCL heterogeneous mapping and I am trying to use CUDA. When I import tensorflow from the Python 3 CLI on my machine, everything works as expected, loading my version of the CUDA library (libcudart.so.11.2.72), but when I import tensorflow from inside of one of your Jupyter notebooks (which are launched from inside the virtualenv you create using make run) I get an ImportError that the file libcudart.so.8.0 could not be found.

To my understanding, the tensorflow version used in your virtualenv needs CUDA version 8.0 explicitly ( ? ). Is there any workaround for this to work with my CUDA version? Tried a workaround with symlinks, but it seems that there are deeper dependencies and it still does not work.

Thanks a lot in advance!

zehanort avatar Jan 05 '21 14:01 zehanort

Hi Sotiris,

Apologies for my slow response! I haven't touched this codebase in 4 years. If I were you I would try removing the dated tensorflow in the virtualenv and just use the working tensorflow version on your system, though there may be API changes that causes the tensorflow code to break. Hope that helps

Cheers, Chris

ChrisCummins avatar Jan 14 '21 21:01 ChrisCummins

Hi,

One of the useful solution is to use docker or singularity container to run this code. docker://tensorflow/tensorflow:1.14.0-gpu-py3 is available to execute the device mapping code. hope it's helpful.

Cheers,

xshaun avatar Jan 22 '21 03:01 xshaun

Thanks @xshaun that's a great suggestion!

ChrisCummins avatar Jan 22 '21 10:01 ChrisCummins