Xiao comments

Results 58 comments of


                                            Xiao

trafficstars

[Question]: how to measure the new single query's latency in a multi step query ?

i use the code to run one query. ``` import torch # from transformers import BitsAndBytesConfig from llama_index.llms.huggingface import HuggingFaceLLM from llama_index.core.prompts import PromptTemplate from llama_index.core import Settings from llama_index.core...

[Question]: how to measure the new single query's latency in a multi step query ?

my log ``` Loading checkpoint shards: 0%| | 0/2 [00:00

[Question]: how to measure the new single query's latency in a multi step query ?

I found there are multiple llm call. I don't undestand why we need multiple llm call. first, we generate multiple new query. This is one llm call then we have...

[Question]: how to measure the new single query's latency in a multi step query ?

> You are using a step decompose query transform > > So it's taking the original query and decomposing it into multiple > > The other queries are because it's...

[Question]: the time of async is same as sync

and the `from llama_index.core.callbacks import CallbackManager, LlamaDebugHandler` does not work according to the doc https://docs.llamaindex.ai/en/stable/examples/callbacks/LlamaDebugHandler/

[Question]: the time of async is same as sync

I use nsys to profile the llama-index, it seems the retrieve is one the cpu side and the llm call is gpu side. is there other things that are cpu...

[Question]: the time of async is same as sync

> @lambda7xx when running a model locally like you are, there is no advantage to async, since it is all compute bound. Async only makes sense for > > a)...

[Question]: use multi-step query and the output is weird.

why is there the wrird llm call?

[Question]: use multi-step query and the output is weird.

i think the below is not related to my query. ``` Question: How many Grand Slam titles does the winner of the 2020 Australian Open have? Knowledge source context: Provides...

[Question]: use multi-step query and the output is weird.

> It seems like your LLM just barfed while generating sub-queries (this "odd" query is a refine step, but the input to the refine step is part of the prompt...