main icon indicating copy to clipboard operation
main copied to clipboard

Enhancement: Custom api endpoints

Open FetchFast opened this issue 1 year ago • 4 comments

Suggestion: Enhancement

An example of a custom endpoint would be: LLaMA 13B https://api.runpod.ai/v2/yourServer/runsync

It would:

  1. allow Intellibar to be used in places where the OpenAI API is not available.
  2. cheaper than openai, possibly by a lot

FetchFast avatar Oct 31 '23 01:10 FetchFast

We've been thinking about this idea ourselves.

I'll be glad if more people join the discussion and share their use cases. That will help us with the details, direction, and importance.

astoilkov avatar Oct 31 '23 06:10 astoilkov

Sure. I'd like to

  1. point the endpoint at my serverless runpod llama, because it's vastly cheaper than openAI
  2. point the endpoint to my server running memgpt, loaded with my own reference material to give me an unlimited context window.

Does that sound useful?

FetchFast avatar Nov 01 '23 02:11 FetchFast

Yes, thanks! I hope others join the discussion as well.

astoilkov avatar Nov 01 '23 08:11 astoilkov

+1 Ollama and more external endpoints.

rb81 avatar Apr 27 '24 12:04 rb81

i need openai compatible endpoint to connect my self-hosted models

bikevit2008 avatar Jan 22 '25 00:01 bikevit2008