LMFlow icon indicating copy to clipboard operation
LMFlow copied to clipboard

Experiments for speculative_decoding

Open taoxunqiang opened this issue 2 years ago • 3 comments

"We tested the speculative inference using the first 100 inputs from alpaca test dataset as prompts. When model=gpt2-xl, draft_model=gpt2".

I want to test speedup for my own model and draft_model. Where can I found the scripts for this?

Thank you in advance.

taoxunqiang avatar Nov 20 '23 07:11 taoxunqiang

"We tested the speculative inference using the first 100 inputs from alpaca test dataset as prompts. When model=gpt2-xl, draft_model=gpt2".

I want to test speedup for my own model and draft_model. Where can I found the scripts for this?

Thank you in advance.

Will work on this ASAP🏃‍♂️. Just for clarification, you want to test spec inference on that specific setting (first 100 inputs from alpaca test dataset as prompts) when model=your_model and draft_model=your_draft_model, or some other settings, or something else?

wheresmyhair avatar Nov 20 '23 17:11 wheresmyhair

"We tested the speculative inference using the first 100 inputs from alpaca test dataset as prompts. When model=gpt2-xl, draft_model=gpt2". I want to test speedup for my own model and draft_model. Where can I found the scripts for this? Thank you in advance.

Will work on this ASAP🏃‍♂️. Just for clarification, you want to test spec inference on that specific setting (first 100 inputs from alpaca test dataset as prompts) when model=your_model and draft_model=your_draft_model, or some other settings, or something else?

Thanks for the reply.
There are no special needs at the moment. I think the first situation is enough to me.

taoxunqiang avatar Nov 21 '23 07:11 taoxunqiang

I am wondering if the problem has been resolved. If you need anything, please feel free to let us know 😄

research4pan avatar Jan 14 '24 07:01 research4pan