hanklin
Results
2
issues of
hanklin
Hi, I want to know how to merge the transformer layers and input embedding layer only. I want to keep the original `lm_head`. Is this possible?
Hi Nathan, I’m currently preparing to release a new repository that contains the code used in my paper. As part of our experiments, we made some slight modifications to the...