TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

Complex beam search

Open haramjo opened this issue 2 years ago • 4 comments

Hello! Thank you for this incredible project. It has been extremely useful.

I've noticed that while the current beam search method is quite effective, there are some variations like "group beam search" and "diverse beam search" that might provide improved results in some scenarios. Would it be possible to consider supporting these methods in future updates?

haramjo avatar Oct 23 '23 23:10 haramjo

"diverse beam search" is supported in TensorRT-LLM, you can enable it through this parameter beam_search_diversity_rate.

I remember "group beam search" is part of "diverse beam search", and we can decide how many groups we want to use. That's what TensorRT-LLM does not support now. The number of group in TensorRT-LLM is fixed when beam_search_diversity_rate is set by non-default value. We will consider how to extend it in the future.

byshiue avatar Oct 24 '23 00:10 byshiue

Oh, sorry, I meant to say "constraint beam search" and "group beam search".

haramjo avatar Oct 24 '23 02:10 haramjo

We don't support constraint beam search, but we support group beam search now.

byshiue avatar Nov 24 '23 09:11 byshiue

Hi folks! Are there updates on constraint beam search? It's something we would like to integrate.

ChristianPala avatar Feb 13 '24 19:02 ChristianPala

Likewise. Imposing constraints on beam search (like HF's decoding strategies) would be invaluable

dhruvmullick avatar May 15 '24 21:05 dhruvmullick

Thank you for the request. To prevent discussing many topics into one bug and lead to confusing, please create another feature request bug to request the constraint beam search. Thank you for the help.

byshiue avatar May 17 '24 07:05 byshiue