hanklin

Results 2 issues of hanklin

Hi, I want to know how to merge the transformer layers and input embedding layer only. I want to keep the original `lm_head`. Is this possible?

Hi Nathan, I’m currently preparing to release a new repository that contains the code used in my paper. As part of our experiments, we made some slight modifications to the...