M. Yusuf Sarıgöz comments

Results 86 comments of


                                            M. Yusuf Sarıgöz

python bindings🐍: Support for accepting list of Input in the encoding methods

Hey @Yossef-Dawoad, with #75, now we're using GGUF and older model files are not usable. can you please update the notebook?

python bindings🐍: Support for accepting list of Input in the encoding methods

Batch inference is working on the C/C++ side, it needs to be exposed to Python. I still have some considerations about how to expose it. The issue is, if we...

not enough space in the context's memory pool (on Apple M1 Max, 32GB RAM, clip-vit-b-32)

It requests ~12 mb instead of 8mb that I set as a fixed value here. https://github.com/monatis/clip.cpp/blob/e2eee8e9b11afe4fc9fdb22d1f6d0ea53df9552a/clip.cpp#L24-L30 You can slightly increase them --8 is for patch32 and 16 is for patch16,...

not enough space in the context's memory pool (on Apple M1 Max, 32GB RAM, clip-vit-b-32)

> That worked! That's great! I'll try to find the root cause of this difference and patch it later on. > Not very low-end in the grand scheme of things...

Can u please make exe of this project?

Sure. There's a major refactoring coming soon in #75. I can release binaries once it's merged shortly.

Metal support?

Hi @kchro3, thanks for reaching out! It needs Conv2D implementation on Metal. I'm currently implementing [LLaVA](https://github.com/haotian-liu/LLaVA/) with llama.cpp and clip.cpp combined (initial release in a few days). The CLIP part...

Metal support?

WIP PR for the initial release of large multimodal model inference is [in another repo](https://github.com/monatis/lmm.cpp/pull/2). Metal implementation of Conv2D will require another pr to the main GGML repo after this...

Implement bicubic interpolation

Yes, I upstreamed clip.cpp to llama.cpp for LLaVA, and now there's some functionality that needs to be downstreamed from llama.cpp back to this repo. Currently I don't have enough time...

Implement bicubic interpolation

@lithdew That would be awesome. And feel free to ping me if you need to ask anything about the implementation.

Support image-only

zsl (zero-shot labeling) examples does that. We need to pass candidate class names with `--text` argument and zsl is scoring those classes.