Alexander Dibrov comments

Results 19 comments of


                                            Alexander Dibrov

Use two image captioning sources

I was going to gather some results to include with the pull request, but the move to new weights broke my llama-13b-4bit. My llama-7b has attention deficit (and with a...

Use two image captioning sources

### **Dual Captioning Test** Images were sent to llama-13b-4bit-128g running with the "LLaMA's primary function is to interact with users..." character card loaded and no chat history. **Relevant parameters:** ```...

Use two image captioning sources

Reopening the branch, as I closed it by accident. Was trying to clean things up before adding a code update, but I'm clearly still working my way through the GitHub...

Use two image captioning sources

So, renaming branches does not work as I expected - just wanted a more descriptive name than 'patch-1 for it. Thank you for bearing with me.

Use two image captioning sources

### **Image Strips and Plain Language Wrapper** I've been able to further improve results using procedures loosely inspired by [retinotopic mapping](https://en.wikipedia.org/wiki/Retinotopy): **1.** Split input into two horizontal strips and two...

Use two image captioning sources

### **DeepDanbooru Integration and UI Options** This version of the extension can now use [DeepDanbooru](https://github.com/KichangKim/DeepDanbooru) tagging to help generate prompts, and it can do so in any combination with the...

LangChain Integration

I would love to see LangChain support. The application I'd be most interested in is [chatbot memory](https://www.pinecone.io/learn/langchain-conversational-memory/). As it stands, I've written a memory extension that does something resembling their...

Please add GPU support for this script. So, When ever I use this script with custom LLM's or Llama. It take too much time to generate because it didn't utilize GPU.

This is perhaps not as fast as performing all model computation on GPU, but you can get a substantial boost to generation rate if you compile llama.cpp with CUBLAS enabled...

Please add GPU support for this script. So, When ever I use this script with custom LLM's or Llama. It take too much time to generate because it didn't utilize GPU.

I struggled to do this myself, and steps will vary from system to system, so your mileage may vary. That being said, you can try something like this: **1.** Obtain...

Fix for silent llama

Attached is a rather lengthy example of the sort of output I get with this PR. More than anything, it illustrates the difficulties that my local model has staying on...