Matthijs Hollemans
Matthijs Hollemans
Add support for Vision's `imageCropAndScaleOption` as an alternative for defining the crop rect by hand.
# What does this PR do? Add the SpeechT5 model to Transformers. See also https://github.com/huggingface/transformers/issues/17569 ## Before submitting - [ ] This PR fixes a typo or improves the docs...
# What does this PR do? Adds fine-tuning support for SpeechT5, in particular the TTS model. The loss function is a combination of L1 loss for the mel-spectrograms, BCE for...
# What does this PR do? Recently the `audio_utils.py` file was added to Transformers to provide shared functions for audio processing such as STFT. This PR aims to clean up...
# What does this PR do? Adds the VITS model for text-to-speech, in particular to support the MMS-TTS checkpoints (which use the same model architecture but a different tokenizer). Fixes...
Hi, I've added a link to my new book. Feel free to tweak the description! :smiley:
Hi there, I've added a link to my new book. Cheers!
Hi, I was trying to run your very interesting model on the input images from the project page, but the generated IRs are always kind of the same and do...
 I think this may be related to the OpenGL context. The information I received from the user is: - Open GL 2 , DirectX 12 , Chipset(graphics) Intel GMA...
I messed up and added the `BinaryData.cpp` files to `git lfs` since I had expected them to be larger than they were. Turns out LFS wasn't actually needed for these...