tvm
tvm copied to clipboard
[Relax] Handle dynamic arguments in legalization of nn.attention
trafficstars
Prior to this commit, when using causal_mask="BottomRight" in R.nn.attention, the legalization would assume that the query and key/value sequence lengths were static integers. This commit updates the legalization to allow dynamic shapes.