Nicolas Patry comments

Results 978 comments of


                                            Nicolas Patry

Output `past_key_values` from `TextGenerationPipeline`.

Oh no that cannot change. But the idea, is that you can call it for a very long range (like `max_new_tokens=100`) which will use the past_key_values over and over without...

Output `past_key_values` from `TextGenerationPipeline`.

Yes, it's intended goal is to decide when to stop generating tokens (hence the return type, false means continue generating, true means stop, iteration will stop when ANY criteria wants...

[WHISPER] Add language to whisper output

Would that work for you ?

[WHISPER] Add language to whisper output

> into a simple function that you call in the preprocess? Sure, I'm not sure I understand how that cleans up the audio trimming, but we can definitely abstract away.

[WHISPER] Add language to whisper output

> could be awesome to have model.detect_language instead of all the mess above and dependencies on whisper! If you have some good ideas, please suggest them instead of waving them...

[Pipelines] Problems with an image-to-text fine-tuned model

I'm not well versed with `Git` as a model. Pipelines are usually agnostic to actual models. As long as model X is `AutoModelForVision2Seq` it should work out of the box....

[Pipelines] Problems with an image-to-text fine-tuned model

Yes that's exactly it. In the absence of tags the hub will check the config and assign a pipeline based on architecture format `ForXX`, just like the pipeline does.

[Pipelines] Problems with an image-to-text fine-tuned model

Do you have a sample script to make it work for captionning ?

[Pipelines] Problems with an image-to-text fine-tuned model

Seems to me that the colab does pretty much what the pipeline does: https://github.com/huggingface/transformers/blob/main/src/transformers/pipelines/image_to_text.py#L114 Any reason not to implement `ForVision2Seq` ?

[Pipelines] Problems with an image-to-text fine-tuned model

> It is a custom model but has the same API as the AutoModelForVision2Seq class So make it `ForVision2Seq`, no ? As long as it upholds the invariant (signature +...