langchain
langchain copied to clipboard
LlamaCpp loading time
Can I improve loading time of Llama Cpp 7b/13b? I am using LlamaCpp function with LLMChain and RetrievalQA.from_chain_type in my python code.