gazelle icon indicating copy to clipboard operation
gazelle copied to clipboard

Add a client for Gazelle

Open handrew opened this issue 9 months ago • 2 comments

Created an abstraction over some of the code in the example notebooks (infer, infer-quantized) so that the user can simply:

from gazelle import GazelleClient
client = GazelleClient(quantization="8-bit")
resp = client.infer(audio, prompt="What does the following audio say? \n <|audio|>")
print(resp)

Because of limited resources, I have not yet been able to extensively test it. It seems to work regularly with quantization, but I occasionally get some errors w/ the conv layer without quantization.

Would be happy to take feedback!

handrew avatar May 10 '24 22:05 handrew