mergekit
mergekit copied to clipboard
Merge only the transformer parts (including the input embedding layer)
Hi,
I want to know how to merge the transformer layers and input embedding layer only. I want to keep the original lm_head
. Is this possible?