KUNPENG GUO
KUNPENG GUO
Hello! I would like to ask about the meaning of this line: https://github.com/lm-sys/FastChat/blob/a26db3c814889035d92c8ae80d6defbd7381ee55/fastchat/serve/inference.py#L189 `max_new_tokens` is for the space for the new generation but what's the `8` for? Thanks in advance...
# 🚀 Feature request Adapter Support For the Longformer models ## Motivation For question-answering over long documents, especially when the answers are long (surpass the `384` or `512` max sequence...
Hey there, In the cached results section, you put two links to the cached retrieval results of SPARTA retriever and BM25... but I found that these two links link to...