Ab

Results 8 comments of Ab

@sshu2017 Can you tell me what is average response time to a question with this? I have it close to 20-45 seconds on an N series Azure VM. Also, the...

@samanemami I got 5 people to ask it 50 questions. It came out to be be close to 50-60%. No, It can cut it off when it wants. It prints...

@samanemami truncated.

@samanemami Nopes

llm_config={"config_list": [ { "model": "gpt-4", "api_key": "sk-" }, { "model": "gpt-3.5-turbo", "api_key": "sk-", "api_version": "2023-03-01-preview" } ]} I am still getting this error, This llm_config seems correct as per the...

@sonichi Some agent may not get proper llm_config , This was it. Thanks a lot for the pointer

> I have fine tuned llama using this repo and a few text documents I had with me. If I provide 3-4 consecutive words from input text, it amazingly completes...

load INSTRUCTOR_Transformer max_seq_length 512 Using embedded DuckDB with persistence: data will be stored in: db CUDA extension not installed. The safetensors archive passed at C:\Users\Administrator/.cache\huggingface\hub\models--TheBloke--Llama-2-7b-Chat-GPTQ\snapshots\b7ee6c20ac0bba85a310dc699d6bb4c845811608\gptq_model-4bit-128g.safetensors does not contain metadata. Make...