Medusa
Medusa copied to clipboard
Token-wise the same generalization?
Is Medusa1 model generalize token-wise the same as the base model w.o. medusa head?
I found change medusa choices will change the output.