DeepLearningExamples icon indicating copy to clipboard operation
DeepLearningExamples copied to clipboard

[DLRM/PyTorch] repository name (library/image-machine-DGX-A100) must be lowercase

Open fuhailin opened this issue 1 year ago • 0 comments

Related to Model/Framework(s) In the DeepLearningExamples/PyTorch/Recommendation/DLRM folder

Describe the bug A clear and concise description of what the bug is. I want to follow the tutorial to generate the Criteo dataset, when I build the docker image using the Dockerfile_preprocessing file, I got the error.

To Reproduce Steps to reproduce the behavior:

  1. git clone https://github.com/NVIDIA/DeepLearningExamples.git
  2. cd DeepLearningExamples/PyTorch/Recommendation/DLRM
  3. docker build -t nvidia_dlrm_preprocessing -f Dockerfile_preprocessing . --build-arg DGX_VERSION=DGX-A100

Expected behavior A clear and concise description of what you expected to happen.

# docker build -t nvidia_dlrm_preprocessing -f Dockerfile_preprocessing . --build-arg DGX_VERSION=DGX-A100
[+] Building 0.1s (1/1) FINISHED                                                                                                                                                                                                                                                                               docker:default
 => [internal] load build definition from Dockerfile_preprocessing                                                                                                                                                                                                                                                       0.0s
 => => transferring dockerfile: 3.07kB                                                                                                                                                                                                                                                                                   0.0s
Dockerfile_preprocessing:75
--------------------
  73 |     ENV NUMBER_OF_GPUS 8
  74 |
  75 | >>> FROM image-machine-${DGX_VERSION} AS final
  76 |     RUN echo "spark.worker.resource.gpu.amount    ${NUMBER_OF_GPUS}" >> /opt/spark/conf/spark-defaults.conf
  77 |
--------------------
ERROR: failed to solve: failed to parse stage name "image-machine-DGX-A100": invalid reference format: repository name (library/image-machine-DGX-A100) must be lowercase

Environment Please provide at least:

  • Repo version: commit id = 38934f9f26d76e9d53a456d0a2ea532e86213d66
  • Docker version:
Client: Docker Engine - Community
 Version:           25.0.0
 API version:       1.44
 Go version:        go1.21.6
 Git commit:        e758fe5
 Built:             Thu Jan 18 17:11:07 2024
 OS/Arch:           linux/amd64
 Context:           default

Server: Docker Engine - Community
 Engine:
  Version:          25.0.0
  API version:      1.44 (minimum version 1.24)
  Go version:       go1.21.6
  Git commit:       615dfdf
  Built:            Thu Jan 18 17:10:01 2024
  OS/Arch:          linux/amd64
  Experimental:     false
 containerd:
  Version:          1.6.27
  GitCommit:        a1496014c916f9e62104b33d1bb5bd03b0858e59
 nvidia:
  Version:          1.1.11
  GitCommit:        v1.1.11-0-g4bccb38
 docker-init:
  Version:          0.19.0
  GitCommit:        de40ad0

fuhailin avatar Mar 05 '24 07:03 fuhailin