torchscale icon indicating copy to clipboard operation
torchscale copied to clipboard

Where is the offset implemented in Multi-head dilated attention ?

Open AshStuff opened this issue 2 months ago • 0 comments

AshStuff avatar Apr 20 '24 19:04 AshStuff