text-generation-inference
text-generation-inference copied to clipboard
feat(server): avoid manipulating position_ids for non-applicable models
Currently, position_ids are always maintained/updated in the CausalLM case but this is unnecessary for models like BLOOM which don't use them.