Implementation ideas for equivalent replacement from xformers to pytorch

Open tzayuan opened this issue 2 months ago • 3 comments

❓ Questions and Help

Independently implementing the equivalent replacement from xformers to pytorch would be a great engineering practice for greatly improving the underlying capabilities of network design. May I ask if you can provide me with some systematic routes for implementing equivalent alternatives of various operator modules to PyTorch implementation (such as xformers.ops.fmha.BlockDiagonalMask.from_seqlens and etc)? Because some modules do not provide equivalent replacement templates in the documentation, how can independent exploration and debug of equivalent implementation solutions be efficient? Looking forward to your suggestions, such as what knowledge to learn and what related papers or code to read for traceability. Thank you.

Apr 09 '24 02:04 tzayuan

xformers xformers copied to clipboard

Implementation ideas for equivalent replacement from xformers to pytorch

❓ Questions and Help

xformers
xformers copied to clipboard