pgai [Feature]: jina-clip-v1

[Feature]: jina-clip-v1

Open ppaanngggg opened this issue 3 months ago • 5 comments

What problem does the new feature solve?

jina-clip-v1 is the best multi-modal embedding model now.

What does the feature do?

It can be used to build better image retrieval application.

Implementation challenges

According to the api https://jina.ai/?sui&model=jina-clip-v1

We need to pass plain text as

{
  "text": "A blue cat"
}

or image from url or base64 encoded as

{
  "image": "https://i.pinimg.com/600x315/21/48/7e/21487e8e0970dd366dafaed6ab25d8d8.jpg"
},
{
  "image": "R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"
}

Are you going to work on this feature?

🆘 No, could someone else please consider working on it?

Oct 31 '24 14:10 ppaanngggg

pgai pgai copied to clipboard

[Feature]: jina-clip-v1

What problem does the new feature solve?

What does the feature do?

Implementation challenges

Are you going to work on this feature?

pgai
pgai copied to clipboard