trafficstars

add single execution option for ConversationalRetrievalChain

The current default workflow for ConversationalRetrievalChain is to rephrase the question based on the chat history and then use the rephrased question to perform retrieval and question answering.
This current two-step process has pros and cons. It helps address the context size limit but it could increase unnecessary latency and incur potential information loss.
We can make the question_generator (chain) argument optional to allow it to bypass the rephrasing step and use a custom prompt to combine the chat_history, context and question together for single execution. This could give users more flexibility based on their use cases and requirements.

Fixes #5123

Before submitting

I updated the notebook to include the ## ConversationalRetrievalChain with only custom combine_docs_chain section

Who can review?

Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @dev2049

May 21 '23 18:05 jpzhangvincent

I believe this feature provides practical values. Can I get a review on this one? @hwchase17 @dev2049

May 26 '23 17:05 jpzhangvincent

Hello guys, we're having a similar issue with the ConversationalRetrievalQAChain class. When adding a Pinecone vector store for memory, it can even reach 45 seconds of total response time, which is far from ideal. This PR does seem to have the potential to mitigate the issue, is it being actively looked at?

Thanks in advance.

May 29 '23 13:05 franco-roura

i think we need just better documentation around conversational retrieval chains in general

Jun 03 '23 21:06 hwchase17

i think we need just better documentation around conversational retrieval chains in general

I have updated the notebook to show how to set question_generator as None and use a custom prompt. But feel free to let me know anything else needed to get this PR merged. Thanks!

Jun 06 '23 18:06 jpzhangvincent

Hoping this can get merged soon since I need this patch for a work project! @hwchase17 @dev2049 any feedback?

Jun 11 '23 17:06 jpzhangvincent

👀👀👀

Jun 13 '23 23:06 italojs

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Mar 6, 2024 0:39am

Jun 27 '23 16:06 vercel[bot]

@dev2049 @hwchase17 can you take a look at this PR? Sorry for chasing..

Jun 27 '23 16:06 jpzhangvincent

How can I implement this solution in my project?

Jul 01 '23 18:07 anthonycoded

A must needed feature. I am looking for this.

Aug 31 '23 06:08 kowsikgelli

@jpzhangvincent Hi , could you, please, resolve the merging issues and address the last comments (if needed)? After that, ping me and I push this PR for the review. Thanks!

Sep 13 '23 20:09 leo-gan

@jpzhangvincent It is again :( Could you, please, resolve the merging issues? After that ping me and I push this PR for the review. Thanks!

Sep 15 '23 01:09 leo-gan

@jpzhangvincent It is again :( Could you, please, resolve the merging issues? After that ping me and I push this PR for the review. Thanks!

@leo-gan I believe I have fixed the merge conflicts. I tried to add a test or an example in the notebook(if needed) but not sure where/which file to add specifically. Open to suggestion!

Sep 15 '23 17:09 jpzhangvincent

@jpzhangvincent There are failed unit tests and pydantic compatibility tests. When you fix them we will be ready.

Sep 15 '23 18:09 leo-gan

@leo-gan I believe I had resolved the test issues.. please review again and feel free to make code changes directly!

Sep 19 '23 06:09 jpzhangvincent

@dev2049 @baskaryan can we get a merge for this? sorry for jumping into a conversation. But waiting to use this feature asap in Prod code.

Sep 22 '23 08:09 techcentaur

@baskaryan Could you, please, review it? Looks like it urgent for users.

Sep 22 '23 16:09 leo-gan

So any update on this PR? Thanks. - @leo-gan @baskaryan

Oct 04 '23 13:10 jpzhangvincent

slow for me as well. 30 seconds

Oct 12 '23 05:10 pixeldu

When will it be fixed？This problem is very urgent。 @leo-gan @baskaryan

Nov 01 '23 06:11 yuanpusheng

just adding myself to the chorus of those begging for this PR to be merged @leo-gan @baskaryan

Nov 02 '23 18:11 DevOtts

@baskaryan @leo-gan Okay I have addressed the feedback and added a unit test. Hopeful we can get this merged soon since it could be low-hanging fruit but useful feature.

Dec 23 '23 03:12 jpzhangvincent

@baskaryan @leo-gan Okay I have addressed the feedback and added a unit test. Hopeful we can get this merged soon since it could be low-hanging fruit but useful feature.

@baskaryan @leo-gan @hwchase17 thoughts on merging this PR?

Jan 05 '24 19:01 jpzhangvincent

Can you please review again? Thanks! @baskaryan @leo-gan

Mar 06 '24 00:03 jpzhangvincent

Thank you for this. Closing because this is readily achievable with create_retrieval_chain. See the example snippet in the API reference. You can also reference this guide on migrating from ConversationalRetrievalChain to LCEL implementations. Please let me know if you have any other concerns.

Jul 08 '24 13:07 ccurme

langchain langchain copied to clipboard