Does LLADA support batch inference now?

Open yyujinxin opened this issue 6 months ago • 2 comments

Should this “1” be batch_size?

Jun 06 '25 02:06 yyujinxin

Sorry, I sincerely apologize for not providing code related to batch inference at present. If you intend to perform batch inference, it might be necessary for you to debug the design of the attention mask on your own.

Jun 08 '25 10:06 nieshenx

Quick follow up on this: why should 1 not be batch_size? If you have suggestions on how to update this, we'd be happy to debug this issue ourselves and suggest a fix.

Jun 12 '25 22:06 adebayoj