continue icon indicating copy to clipboard operation
continue copied to clipboard

Connection for use sagemaker Endpoint

Open theja0473 opened this issue 10 months ago • 5 comments

Validations

  • [X] I believe this is a way to improve. I'll try to join the Continue Discord for questions
  • [X] I'm not able to find an open issue that requests the same enhancement

Problem

Currently Ollama or any self hosted services only allowed my suggestion lets connect to end point of Sagemaker and serve Copilot use case

Solution

No response

theja0473 avatar Apr 16 '24 02:04 theja0473

Hi @theja0473 can you share more about what API format you expect for the Sagemaker endpoint?

sestinj avatar Apr 19 '24 17:04 sestinj

We are also interested in hosting the model in Sagemaker. The format of the API is:

https://runtime.sagemaker.us-west-2.amazonaws.com/endpoints/my-model-id/invocations

For boto, the documentation of how to invoke the endpoint: https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker-runtime/client/invoke_endpoint.html

The credentials can be provided the same way as you have it implemented for bedrock: https://docs.continue.dev/reference/Model%20Providers/bedrock

mikhail-khodorovskiy avatar Jun 19 '24 21:06 mikhail-khodorovskiy

I am also very interested in hosting a model in Sagemaker. I think the main thing to add (as @mikhail-khodorovskiy pointed out) is credentials handling identical to how Bedrock is done.

tobiasbartsch avatar Jun 24 '24 21:06 tobiasbartsch

We actually were able to import the model we want into Bedrock where the model id but it still does not work: https://github.com/continuedev/continue/issues/1749

mikhail-khodorovskiy avatar Jul 16 '24 22:07 mikhail-khodorovskiy