transformers icon indicating copy to clipboard operation
transformers copied to clipboard

Blip dynamic input resolution

Open zafstojano opened this issue 9 months ago • 1 comments

What does this PR do?

This PR adds ability to handle dynamic input image resolutions to the BLIP family of models.

Addresses #30579

Before submitting

  • [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • [x] Did you read the contributor guideline, Pull Request section?
  • [x] Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
  • [x] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
  • [x] Did you write any new necessary tests?

Ping: @amyeroberts

zafstojano avatar May 08 '24 20:05 zafstojano

@amyeroberts good point!

the tests now also have a check for the textual content.

zafstojano avatar May 11 '24 08:05 zafstojano

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.