bigcodebench icon indicating copy to clipboard operation
bigcodebench copied to clipboard

🤗 [REQUEST] - multi-token-prediction

Open ethanc8 opened this issue 1 year ago • 0 comments
trafficstars

Model introduction

This is a family of four models, where two of the models have been trained to generate 4 tokens per forward pass instead of only a single token like most current LLMs. Multi-token prediction shows significant growths compared to single-token prediction in older benchmarks, so it'd be good to see how much growth can be found in newer benchmarks like BigCode-Bench. These models are not particularly strong, having been trained on 1T tokens or even less.

Model URL

https://huggingface.co/facebook/multi-token-prediction

Additional instructions (Optional)

Inference seems to currently require using Meta's example code.

Author

No

Security

  • [X] I confirm that the model is safe to run which does not contain any malicious code or content.

Integrity

  • [X] I confirm that the model comes from unique and original work and does not contain any plagiarism.

ethanc8 avatar Jul 04 '24 15:07 ethanc8