openai-cookbook Initial commit of vector database example with new embeddings

trafficstars

This PR contains a notebook running through an example of using our embeddings to embed Simple Wikipedia and then indexing and searching it in both Weaviate and Pinecone.

Jan 05 '23 09:01 colin-openai

Hi, would you mind adding Qdrant as another option? I can provide a working example similar to the created ones.

Qdrant https://github.com/qdrant/qdrant is a high-performant vector search database written in Rust. The fastest open-source solution available a the moment according to benchmarks. https://qdrant.tech/benchmarks/

Jan 17 '23 12:01 kacperlukawski

Couple more suggestions:

Add installation instructions for pinecone, weaviate, and qdrant_client, as most people won't have them. I think it's especially valuable here since the package names and import names aren't the same. Looks like pinecone-client and weaviate-client and qdrant-client, vs pinecone, weaviate, and qdrant_client.
Add qdrant to the table of contents/outline at the top
After running pip install --upgrade pinecone-client I immediately run into an error when importing it. Not sure why, but I want to figure it out before we merge

Jan 20 '23 01:01 ted-at-openai

^on the error I'm hitting, I've emailed pinecone support and will wait to hear back.

Jan 20 '23 02:01 ted-at-openai

One last suggestion: I think it would be helpful if you precomputed the embeddings, stored them on our CDN, and let people download them, so that they don't have to pay $5 each time they run the example. This is what we've done in some of the other examples. Feel free to pick any file format you like. DM me to discuss how to upload to our CDN and what URL we'll want to give it.

Then, if everything runs on your end, we can merge. Thanks again for all the work on this!

Jan 26 '23 22:01 ted-at-openai

One last suggestion: I think it would be helpful if you precomputed the embeddings, stored them on our CDN, and let people download them, so that they don't have to pay $5 each time they run the example. This is what we've done in some of the other examples. Feel free to pick any file format you like. DM me to discuss how to upload to our CDN and what URL we'll want to give it.

Then, if everything runs on your end, we can merge. Thanks again for all the work on this!

@ted-at-openai these are now resolved

Feb 06 '23 11:02 colin-openai

openai-cookbook openai-cookbook copied to clipboard

Initial commit of vector database example with new embeddings

openai-cookbook
openai-cookbook copied to clipboard