EAGLE icon indicating copy to clipboard operation
EAGLE copied to clipboard

Questions on EAGLE-3 training-time test and request for training code release

Open junghye01 opened this issue 7 months ago • 1 comments

Hi, Thank you for your great work on EAGLE-3. I am writing to ask a few questions regarding the training process, especially the training-time test technique, and to inquire about the code release timeline.

I understand that EAGLE-3 reuses draft model outputs as inputs during training to simulate inference-time behavior. I have a few questions on this:

  1. Since this simulation resembles autoregressive decoding, I assume it makes full batch parallelism difficult. If so, did it significantly slow down training?
  • Or are there specific techniques used to maintain training efficiency despite this?
  1. At what point in training is the inference simulation applied (e.g., from the beginning, after warm-up steps, etc.)?

Also, could you kindly share:

  • The code used to generate the EAGLE-3 training dataset

  • The training code that implements the training-time test technique, or let us know when it will be released?

These would be extremely helpful for fully understanding and reproducing EAGLE-3.

Thank you again for your time and contribution.

junghye01 avatar Apr 29 '25 08:04 junghye01

I have the same questions as @junghye01 , did u solved these problems or anybody can help with this ,thanks

SiqiLi-Fighting avatar May 14 '25 12:05 SiqiLi-Fighting

The training code has been released.

hongyanz avatar Jun 13 '25 04:06 hongyanz