paper-end2end-dl Replicating experiments with CUDA version

Replicating experiments with CUDA version > 8.0

Open zehanort opened this issue 4 years ago • 3 comments

Hello Chris!

I am trying to replicate your first case study regarding OpenCL heterogeneous mapping and I am trying to use CUDA. When I import tensorflow from the Python 3 CLI on my machine, everything works as expected, loading my version of the CUDA library (libcudart.so.11.2.72), but when I import tensorflow from inside of one of your Jupyter notebooks (which are launched from inside the virtualenv you create using make run) I get an ImportError that the file libcudart.so.8.0 could not be found.

To my understanding, the tensorflow version used in your virtualenv needs CUDA version 8.0 explicitly ( ? ). Is there any workaround for this to work with my CUDA version? Tried a workaround with symlinks, but it seems that there are deeper dependencies and it still does not work.

Thanks a lot in advance!

Jan 05 '21 14:01 zehanort

Hi Sotiris,

Apologies for my slow response! I haven't touched this codebase in 4 years. If I were you I would try removing the dated tensorflow in the virtualenv and just use the working tensorflow version on your system, though there may be API changes that causes the tensorflow code to break. Hope that helps

Cheers, Chris

Jan 14 '21 21:01 ChrisCummins

Hi,

One of the useful solution is to use docker or singularity container to run this code. docker://tensorflow/tensorflow:1.14.0-gpu-py3 is available to execute the device mapping code. hope it's helpful.

Cheers,

Jan 22 '21 03:01 xshaun

Thanks @xshaun that's a great suggestion!

Jan 22 '21 10:01 ChrisCummins

paper-end2end-dl paper-end2end-dl copied to clipboard

Replicating experiments with CUDA version > 8.0

paper-end2end-dl
paper-end2end-dl copied to clipboard