llama-stack
llama-stack copied to clipboard
add token usage metadata to agents turn execution results
🚀 Describe the new functionality needed
add token usage metadata from inference steps to agents when getting turn execution results
💡 Why is this needed? What if we don't build it?
for majority of applications it is required to track token usage to estimate costs associated with the agent utilization
Other thoughts
No response