MISA
MISA copied to clipboard
tensor transpose kernels
req:
- [x] input/output: nchw->nchw-vecc nchw-vecc->nchw
- [ ] weight: nchw->chwn-vecc
- [ ] padding transpose: for cases c%vecc!=0, padding 0 at vecc's tail