Dong Li
Dong Li
Following your steps, I divided cnews into two tag sets and trained two teacher models, but when I am ready to run monte_carlo.py by myself, I do not find the...
Can you share your prompt about code scoring data production? I want to make a c and c++ dataset for pre-training using this prompt. Of course, if you have already...
First of all, thank you for open-sourcing the implementation of speculative decoding at batch size > 1. I would like to ask if it is possible to adapt directly to...