gpt-fast
gpt-fast copied to clipboard
batching/dynamic batching
Thanks for the amazing work! It really is super fast at bs=1.
Can batch usecases, or dynamic batching be supported?