Norm Inui
Norm Inui
### [Question] As [here](https://pytorch.org/serve/management_api.html#scale-workers) mentioned: > max_worker is the parameter that TorchServe will make no more than this number of workers for the specified model. Does that mean that TorchServe...
I notice the attention mask in the function `generate_and_tokenize_prompt` is weird. If we set attention masks like ```python "attention_mask": [1] * (len(full_tokens)) ``` then, the padded token will be counted...
Thank you very much for your awesome work. Would you mind providing the annotated image download links?
### Issue Description I host an Image-to-Text pipeline with [this](https://huggingface.co/Norm/nougat-latex-base) model for a while. The Inference API widget worked quite well until recently when some developers reported that the inference...