inference
inference copied to clipboard
ENH: accelerate pytorch model generate stream
optimize tokenizer.decode method
This issue is stale because it has been open for 7 days with no activity.
This issue was closed because it has been inactive for 5 days since being marked as stale.