Hongkun Chen
Results
12
comments of
Hongkun Chen
@Ageliss Awesome paper on scaling law on spec decoding!! But I still have some questions in the paper, which only used EAGLE2 configuration, and exclude EAGLE3 train-time test + feature...
Thanks for the reply!! Seems like the norm layer plays a critical role in the scaling! It is also mentioned at this [issue](https://github.com/SafeAILab/EAGLE/issues/220) in EAGLE repo