Haicheng Wu
Haicheng Wu
okay, could you please file a PR to us?
this should be fixed now.
Could you please elaborate the input and output of every step? Do you want to fuse two gemms into one kernel similar as what our ex.13 does?
have you tested them if they are working? we would need some unit tests or adding them to the profiler to verify them.
you could check this https://github.com/hpcgarage/cuASR cc @thakkarV
@Junkai-Wu
we are open to suggestions and prs.