[feature-request] cuDNN support for HuggingFace image

Open robinsujob opened this issue 3 years ago • 0 comments

Checklist

[x] I've prepended issue tag with type of change: [feature]
[ ] (If applicable) I've documented below the DLC image/dockerfile this relates to
[ ] (If applicable) I've documented the tests I've run on the DLC image
[x] I'm using an existing DLC image listed here: https://docs.aws.amazon.com/deep-learning-containers/latest/devguide/deep-learning-containers-images.html
[ ] I've built my own container based off DLC (and I've attached the code used to build my own image)

Concise Description: When I depoly DallE model from Huggingface, it shown on : 2022-11-30T16:46:08,647 [INFO ] W-model-1-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - mms.service.PredictionException: FAILED_PRECONDITION: Couldn't get ptxas version string: INTERNAL: Couldn't invoke ptxas --version : 400

DLC image/dockerfile: 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-tensorflow-inference:2.6.3-transformers4.17.0-gpu-py38-cu112-ubuntu20.04

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like A clear and concise description of what you want to happen. There no nvcc and ptxas on this image. Pls support cuDNN for this docker image.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

Nov 30 '22 16:11 robinsujob