langchain icon indicating copy to clipboard operation
langchain copied to clipboard

Add RELLM experimental LLM decoding

Open vowelparrot opened this issue 1 year ago • 2 comments

RELLM is a library that wraps local HuggingFace pipeline models for structured decoding.

It works by generating tokens one at a time. At each step, it masks tokens that don't conform to the provided partial regular expression.

Checked it out and got a small HuggingFace model to generate structured format easily :)

vowelparrot avatar May 05 '23 16:05 vowelparrot

@vowelparrot Is the output from the first cell correct? I think it's supposed to illustrate that the response that's generated isn't correct, but I don't see it generating an auto-completion response

image

eyurtsev avatar May 09 '23 14:05 eyurtsev

Looks good to me @vowelparrot, @nfcampos let a comment.

eyurtsev avatar May 09 '23 14:05 eyurtsev

@vowelparrot Is the output from the first cell correct? I think it's supposed to illustrate that the response that's generated isn't correct, but I don't see it generating an auto-completion response

image

Ya the model just repeats the question there. It's a really small model to fit on my laptop

vowelparrot avatar May 14 '23 19:05 vowelparrot