Yuchao Zhang
Yuchao Zhang
user id, conversation id or session id may also be useful for vector store retrieving
@ajndkr glad to contribute if have time. One question, currently the langchain object is created before passing to the `LangChainRouter` and wrapped in a simple `create_langchain_dependency` for dependency injection. Actually,...
@ajndkr In current implementation, the request and response model are dynamically determined by `langchain_object` with `create_request_from_langchain_dependency`. What if the constrcution `langchain_object` needs to parse request first to get something like...
The code linked uses all the features except `add_langchain_api_route` part of `LangchainRouter`. So is that also immature since it uses `create_request_from_langchain_dependency`?
I'm wondering if dora is the same as ray or maybe there's something special?
@wyfo I guess I'm a bit lucky. I just want to write a async data loading framework for deep learning today. After searching all day, I find your promising work....
Thanks for your detailed reply. I will have a try with PyO3 master. One question is how is the performance? I don't which is better: 1. use `AllowThreads` with stream...
which part do you mean? Triton backend should have parameter like stop_words
@poddamatt98 I will take a look at this when I have time.