langchain provide visibility into final prompt

For debugging or other traceability purposes it is sometimes useful to see the final prompt text as sent to the completion model.

It would be good to have a mechanism that logged or otherwise surfaced (e.g. for storing to a database) the final prompt text.

Feb 06 '23 20:02 wskish

this should be possible with tracing! have you tried it out? https://langchain.readthedocs.io/en/latest/tracing.html

Feb 07 '23 04:02 hwchase17

I kept an eye on tracing debug system documentation, nevertheless a minimal requirement could be to just have some "very verbose" flag for llms and/or chains, to print out the LLM prompts (+ completions).

BTW, This is not an issue but a request.

Consider the following chunk:

llm = OpenAI(temperature=0)

template='''\
Please respond to the questions accurately and succinctly. \
If you are unable to obtain the necessary data after seeking help, \
indicate that you do not know.
'''

prompt = PromptTemplate(input_variables=[], template=template)

llm_weather_chain = LLMChain(llm=llm, prompt=prompt, verbose=True)

tools = [Weather, Datetime]

agent = initialize_agent(tools, llm, agent="zero-shot-react-description", verbose=True)

The output of the above program shows the agent behavior, in great colorized texts.

but it doesn't show the text prompts sent to the LLM (and the completions feedback). This is what I miss more. I'd like to have a llm() argument verbose=True to have llm interaction printed too.
BTW, what the chain flag verbose=True does? I don't see anything printed.

Feb 07 '23 17:02 solyarisoftware

there is a verbose flag you can pass into the llm! llm = OpenAI(temperature=0, verbose=True) should print it out

Feb 08 '23 15:02 hwchase17

Thanks Harrison,

I set the LLM verbose flag to true, but I don't see the LLM prompt+completion printed out.
Also the LLMChain verbose flag seems doing nothing ?!
Instead the Agent verbose flag works as expected, printing nice coloured traces.

$ cat agent.py

#
# tools_agent.py
#
# zero-shot react agent that reply questions using available tools
# - Weater
# - Datetime
#
import sys

from langchain.agents import initialize_agent
from langchain.llms import OpenAI
from langchain import LLMChain
from langchain.prompts import PromptTemplate

# import custom tools
from weather_tool import Weather
from datetime_tool import Datetime

llm = OpenAI(temperature=0, verbose=True)

template='''\
Please respond to the questions accurately and succinctly. \
If you are unable to obtain the necessary data after seeking help, \
indicate that you do not know.
'''

prompt = PromptTemplate(input_variables=[], template=template)

# Load the tool configs that are needed.
llm_weather_chain = LLMChain(
    llm=llm,
    prompt=prompt,
    verbose=True
)

tools = [
    Weather,
    Datetime
]

# Construct the react agent type.
agent = initialize_agent(
    tools,
    llm,
    agent="zero-shot-react-description",
    verbose=True
)


if __name__ == '__main__':
    if len(sys.argv) > 1:
        question = ' '.join(sys.argv[1:])
        print('question: ' + question)

        # run the agent
        agent.run(question)
    else:
        print('Agent answers questions using Weater and Datetime custom tools')
        print('usage: py tools_agent.py <question sentence>')
        print('example: py tools_agent.py what time is it?')

$ py agent.py "how is the weather today in Genova?"

question: how is the weather today in Genova?

> Entering new AgentExecutor chain...
 I need to get the weather forecast for Genova
Action: weather
Action Input: {"when":"today","where":"Genova"}
Observation: {"forecast": "sunny", "temperature": "20 degrees Celsius"}
Thought: I now know the weather forecast for Genova
Final Answer: The weather today in Genova is sunny with a temperature of 20 degrees Celsius.

> Finished chain.

Feb 08 '23 16:02 solyarisoftware

good catch - we need to fix this bug probably, but currently the wya to do it would actually be to set

agent.agent.llm_chain.verbose=True

Feb 11 '23 07:02 hwchase17

Thanks. The workaround works, but yes I think it's a bug.

Feb 13 '23 15:02 solyarisoftware

I am studying the project and wanted to do some contributions, fixing some bugs/issues might be a good start, so I read through this issue and related code, I think the issue happens because there are actually two chains:

an outer one(AgentExecutor is a chain) returned by initialize_agent(...)
an inner one created when initializing the agent, this inner one is the chain that really talks to the LLM

initialize_agent(..., verbose=True) just makes the outer chain verbose, and the workaround agent.agent.llm_chain.verbose=True is for the inner chain

With the above analysis, I think there might be 2 ways to fix the issue:

when initialize_agent is called with verbose=True, we also set the inner chain verbose, this makes the API simple and easy to understand, the problem is that the API's output might be over verbose
call initialize_agent like initialize_agent(..., agent_kwargs={"verbose": True}, verbose=True), the agent_kwargs is for the inner chain, we can use it to control the inner chain's output, this way, we can have precise control for the output, but the API might be a little bit confusing

@hwchase17 do you have some suggestions?

Mar 01 '23 08:03 MacYang555

Thanks! I'd add a note on the functional meaning of "verbose"

when referred to a LLM instance, the expected behavior (in my mind) is to show ALL interactions (prompt+completion) with the LLM.
when referred to an agent instance, the expected behavior is the working logs that explain what the agent is doing.

so, i see the LLM verbosity as something different (at a lower level) from the agent verbosity. Does it make sense?

Mar 01 '23 09:03 solyarisoftware

maybe adding a verbose_llm parameter to initialize_agent will make the API easy to understand

Mar 01 '23 10:03 MacYang555

Well, it could be a way, but currently, when you set verbose==True I see these different cases:

llm => does nothing (a bug ?)
chain => does nothing because the bug (?): prompt+completion logs print
agent => the agent behavior logs print

So llm, chain, agent already have their own distinct verbose flag with possible different meaning. I'd prefer to leave the same flag name "verbose". What is not fully clear is what means verbose for each of llm, chain, agent.

Mar 01 '23 12:03 solyarisoftware

Yep, now impossible to see final executed promst :(

Apr 22 '23 12:04 maxbaluev

Is there any update on this? I think it is critical to be able to see the final prompt sent to the LLMs. Currently working with Langchain is too obscure, it makes it really difficult to build complex chains without making mistakes.

May 25 '23 19:05 alberduris

having the same issue, I need to see the final prompt too.

Jun 12 '23 23:06 medram

It looks like one possible workaround the get the final prompt is to attach a StdOutCallbackHandler to the chain

handler = StdOutCallbackHandler()
chain.run(... , callbacks=[handler])

Jun 21 '23 15:06 forin87

Setting agent.agent.llm_chain.verbose=True worked for me with the latest version (langchain==0.0.216). I agree that expected behavior is that setting verbose=True on the LLM would do this.

Jun 27 '23 17:06 rogerbock

Modifying langchain.debug=True will print every prompt agent is executing with all the details possible. Use this code:

import langchain
langchain.debug=True
response = agent.run(prompt)
langchain.debug=False

Output of this may not be as pretty as verbose. I think verbose is designed to be on higher level for individual queries but for debugging and granular control debug is more useful.

Aug 01 '23 07:08 Im-Himanshu

Using langchain (0.0.256)

Building on forin87. Bellow logs the message to the console:

import langchain
langchain.verbose=True

If you want the prompt as a variable, I'd suggest using callbacks:

from langchain.chains import LLMChain
from langchain.llms import OpenAI
from langchain.prompts import PromptTemplate
from langchain.callbacks.base import BaseCallbackHandler


class MyCustomHandler(BaseCallbackHandler):
    def on_chain_start(self, serialized, inputs, **kwargs):
        // parse serialized and save to db


handler = MyCustomHandler()
llm = OpenAI()
prompt = PromptTemplate.from_template("1 + {number} = ")

chain = LLMChain(llm=llm, prompt=prompt, callbacks=[handler])
chain.run(number=2)

Hopefully this helps!

Aug 17 '23 13:08 utbdankar

Is there an update to this? On top of final prompt, i believe the final response coming from OpenAI would be helpful, things like prompt token count, completion token, stop reason, etc.

Sep 07 '23 04:09 mikelam14

Hi, @wskish

I'm helping the LangChain team manage their backlog and am marking this issue as stale. The issue you raised requests a mechanism to provide visibility into the final prompt text sent to the completion model for debugging and traceability purposes. The comments discuss various workarounds and potential solutions, including setting the verbose flag for the LLM and agent instances, using callback handlers, and modifying the langchain debug setting. There is also a suggestion to add a verbose_llm parameter to initialize_agent for better control over the output. The issue has garnered significant interest and support from the community, with multiple users expressing the need for this feature.

Could you please confirm if this issue is still relevant to the latest version of the LangChain repository? If it is, please let the LangChain team know by commenting on the issue. Otherwise, feel free to close the issue yourself, or the issue will be automatically closed in 7 days. Thank you!

Dec 07 '23 16:12 dosubot[bot]

Would be interesting to see if there are updates on this issue

Jan 25 '24 17:01 cryoff

If anyone is looking for a simple string output of a single prompt, you can use the .format() method of ChatPromptTemplate, but should work with any BaseChatPromptTemplate class.

I struggled to find this as well. In my case I wanted the final formatted prompt string being used inside of the API call.

Example usage:

# Define a partial variable for the chatbot to use
my_partial_variable = """APPLE SAUCE"""

# Initialize your chat template with partial variables
prompt_messages = [
    # System message
    SystemMessage(content=("""You are a hungry, hungry bot""")),
    # Instructions for the chatbot to set context and actions
    HumanMessagePromptTemplate(
        prompt=PromptTemplate(
            template="""Your life goal is to search for some {conversation_topic}. If you encounter food in the conversation below, please eat it:\n###\n{conversation}\n###\nHere is the food: {my_partial_variable}""",
            input_variables=["conversation_topic", "conversation"],
            partial_variables={"my_partial_variable": my_partial_variable},
        )
    ),
    # Placeholder for additional agent notes
    MessagesPlaceholder("agent_scratchpad"),
]

prompt = ChatPromptTemplate(messages=prompt_messages)
prompt_as_string = prompt.format(
    conversation_topic="Delicious food",
    conversation="Nothing about food to see here",
    agent_scratchpad=[],
)

print(prompt_as_string)

System: You are a hungry, hungry bot
Human: Your life goal is to search for some Delicious food. If you encounter food in the conversation below, please eat it:
###
Nothing about food to see here
###
Here is the food: APPLE SAUCE

Feb 09 '24 05:02 nathanjones4323

I ended up using callbacks (like StdOut / self-implemented loguru-based / langfuse / arize-phoenix / mlflow / wandb)

Feb 23 '24 18:02 cryoff

Wait, what is the final solution for this though? I can't wrap my head around making things complex for something that should have been basic.

Feb 24 '24 15:02 krishna-praveen

@krishna-praveen for me it is usage of community-provided / self-implemented langchain callback mechanism

Mar 09 '24 13:03 cryoff

chain.prompt.format_prompt(**input)

Apr 03 '24 13:04 pabloesdev1

langchain langchain copied to clipboard

provide visibility into final prompt

langchain
langchain copied to clipboard