Paddle
Paddle copied to clipboard
使fused_multi_transformer支持动态设置cache kv大小和输入prefix_caches
PR types
Function optimization
PR changes
OPs
Describe
使fused_multi_transformer支持动态设置cache_kv大小和支持输入prefix_caches。
你的PR提交成功,感谢你对开源项目的贡献! 请关注后续CI自动化测试结果,详情请参考Paddle-CI手册。 Your PR has been submitted. Thanks for your contribution! Please wait for the result of CI firstly. See Paddle CI Manual for details.