text-generation-inference icon indicating copy to clipboard operation
text-generation-inference copied to clipboard

Update Dockerfile to use devel image for compatibility

Open YaserJaradeh opened this issue 10 months ago • 9 comments

What does this PR do?

The TGI server fails to start due to missing Python headers during the compilation of Triton indexing kernels. The solution is to change the base image to nvidia/cuda:12.4.1-devel-ubuntu22.04 to match the builder image, ensuring the necessary headers are included. This change increases the image size but resolves the startup issue.

Fixes # (issue) This pull request addresses the issue #2838

Before submitting

  • [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • [x] Did you read the contributor guideline, Pull Request section?
  • [ ] Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
  • [x] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
  • [ ] Did you write any new necessary tests?

YaserJaradeh avatar Dec 16 '24 13:12 YaserJaradeh