spring-ai Added suport to anthropic prompt cache

trafficstars

Thank you for taking time to contribute this pull request! You might have already read the [contributor guide][1], but as a reminder, please make sure to:

issue https://github.com/spring-projects/spring-ai/issues/1403

I added support to user prompt use cache, added test only to api because Chatmodel/AnthropicChatOptions not can set header beta to works cache

Sign the contributor license agreement
Rebase your changes on the latest main branch and squash your commits
Add/Update unit tests as needed
Run a build and make sure all tests pass prior to submission

Sep 26 '24 01:09 Claudio-code

thanks, can you explain the comment "Chatmodel/AnthropicChatOptions not can set header beta to works cache" please?

Sep 27 '24 17:09 markpollack

Anthropic models added caching as a beta feature that requires a custom header to be used. I couldn't find a way to send a custom header using the options in the Chatmodel/AnthropicChatOptions classes.

That's why I added integration tests only in the AnthropicApiIT.java class.

It would be nice if in the future there was an option to send a custom header in ChatModel class to test beta features like langchain4j https://github.com/langchain4j/langchain4j/issues/1591#issuecomment-2357792634.

With this type of option it would be easier to add integration tests for beta features in the AnthropicChatModelIT.java class.

Sep 27 '24 17:09 Claudio-code

Hi. Thanks for the clarification. I think it is a bit too much to get into M3 only because it modifies the AbstractMessage class and I'd like to think through the implications. I guess much like function calling, this is going to "table stakes" for leading model providers. I'll schedule it for M4 with the hopes we have a 5-ish week release cadence on our march to GA.

thanks for the nice PR.

Oct 06 '24 14:10 markpollack

@Claudio-code @markpollack @tzolov any chance this be merged any sooner than March?

Dec 16 '24 20:12 RRajdev

We are waiting for prompt cache feature too. It should help us to save tokens. Our app sends screenshots to Anthropic and we chose Spring AI framework to deal with LLMs

Dec 19 '24 09:12 piratovi

I saw that the cache left beta, now I can add more complete tests to validate the creation of token caches and the reading of these caches.

Dec 23 '24 22:12 Claudio-code

Thanks, just going through the issues, will aim to merge for m7. thanks for you patience.

Mar 25 '25 15:03 markpollack

@Claudio-code , thanks for your patience. I've rebased your PR with recent changes, cherry-picking some parts and creating a new updated PR: https://github.com/spring-projects/spring-ai/pull/2669 You remain the author. Could you please review to ensure nothing was broken during this process?

Apr 08 '25 09:04 tzolov

Replacing this PR by https://github.com/spring-projects/spring-ai/pull/2669

Apr 08 '25 09:04 tzolov

spring-ai spring-ai copied to clipboard

Added suport to anthropic prompt cache

spring-ai
spring-ai copied to clipboard