Carlos Zambrana
Results
1
comments of
Carlos Zambrana
> > > I am using "llama-2-7b-chat.ggmlv3.q2_K.bin" using "LlamaCpp()" in langchain. The process of "Llama.generate: prefix-match hit" repeats itself so many times. But I want answer only once. How can...