Greysuki

Results 16 issues of Greysuki

I have noticed that when the input audio file exceeds approximately 30 seconds in duration, the resulting output file contains only the first 10 seconds or the last 10 seconds...

- [x] [Stabilizing Timestamps for Whisper](https://github.com/jianfch/stable-ts) - [ ] [whisper-webui](https://huggingface.co/spaces/aadnk/whisper-webui) - [ ] [faster-whisper](https://huggingface.co/spaces/aadnk/faster-whisper-webui) This should be more easy to import model than modify every thing.

enhancement
long-term goal

- [ ] [speechbrain](https://github.com/speechbrain/speechbrain) - [ ] [Fine-tuning or using Whisper, wav2vec2, HuBERT](https://colab.research.google.com/drive/17Hu1pxqhfMisjkSgmM2CnZxfqDyn2hSY?usp=sharing) - [ ] [Wav2vec 2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/) - [ ] [wav2vec-U](https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/unsupervised/README.md)

enhancement
long-term goal

enhancement
next version

enhancement
future work

enhancement
long-term goal

https://www.assemblyai.com/blog/getting-started-with-huggingfaces-gradio/