guided-decoding topic

List guided-decoding repositories

dash-infer

270
Stars
27
Forks
270
Watchers

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

learning-from-rewards-llm-papers

59
Stars
2
Forks
59
Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...