amazon-sagemaker-examples icon indicating copy to clipboard operation
amazon-sagemaker-examples copied to clipboard

deploy llama-2-13b on inf2 using TorchServe

Open lxning opened this issue 9 months ago • 20 comments

Issue #, if available:

Description of changes: This example demonstrates deploying llama-2-13b on inf2 using TorchServe.

Testing done: Done

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

  • [X] I have read the CONTRIBUTING doc and adhered to the example notebook best practices
  • [X] I have updated any necessary documentation, including READMEs
  • [X] I have tested my notebook(s) and ensured it runs end-to-end
  • [X] I have linted my notebook(s) and code using black-nb -l 100 {path}/{notebook-name}.ipynb

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

lxning avatar Sep 21 '23 04:09 lxning