blog
blog copied to clipboard
Public repo for HF blog posts
This link [smangrul/peft-lora-sd-dreambooth](https://huggingface.co/spaces/smangrul/peft-lora-sd-dreambooth) from https://huggingface.co/blog/peft doesn't appear to exist anymore.
I followed this blog post and used it to fine tune the whisper model using a custom data set, but after training when trying to run this command `trainer.push_to_hub(**kwargs)` it...
The UNet model should be epsilon_theta, not z. z is noise.
typo
I followed this blog post and used it to fine tune the whisper model using a custom data set, but after training when trying to run this command `trainer.push_to_hub(**kwargs)` it...
"I read the blog post (https://huggingface.co/blog/zh/time-series-transformers) and I'm really confused about the evaluation metrics. I would be extremely grateful if I could get some help. Specifically, I'm unsure about how...
Hello, quick question about the Mistral-8x7b inference, i've read it online that during inference it's using a router network to select just 2 of the models to produce the outputs,...
Dear @sanchit-gandhi, I was following your tutorial, [Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers](https://huggingface.co/blog/fine-tune-whisper), to fine-tune Whisper with a dataset in the Amharic language. Amharic is used in Whisper...
Thanks for the great blog (and quick turnaround) explaining MoE. Clearly MoE have a speed advantage over 2017 transformers, because their inference activates only a portion of the model. As...
Unity 2023 has moved Textmesh pro functionality to com.unity.ugui. Requiring the Textmeshpro package conflicts with this resource.