Forkoz comments

Results 474 comments of


                                            Forkoz

Add option for fp16 kv cache

Changed the fp32s near KV_cache to FP16 and didn't experience any loss in quality. Unfortunately the memory stayed about the same. A lot of other calculations are also done in...

Add option for fp16 kv cache

Where else did you change it? I added both model.half and changed the float32s to float16s. The memory can still spike.

Added gradio app

It worked for me but I'm not sure why it was locked to only the demo audio. I just made it editable and then the UI works.

``` with gr.Row(): with gr.Column(scale=2): input_audio = gr.Audio(value="./demo/84_121550_000074_000000.wav", label="Input Audio", type="filepath", interactive=False) with gr.Group(): ``` I just changed interactive to true.

Added gradio app

It worked like this. I didn't check the console, sometimes there are exceptions. Usually reloading fixes it. I was using it all day yesterday after making it listen on more...

Added gradio app

New version is great. The voices sound a lot better. Only thing is that I get a bit of the last word it's continuing from in my actual prompt and...

Added gradio app

whisperX is missing some arguments so I haven't been able to try it. Using larger whisper models also helped with the timestamps. I guess the only other foible I noticed...

Added gradio app

I got it all working, had to use the correct version of whisperx. The one from pip didn't work. Had to use git version. Sometimes I get a bug that...

Added gradio app

BTW, I had already used the new model this morning. Output is fairly similar. I have not tried with less batches. Any ideas on why sometimes the output is really...

Added gradio app

>The new model should work well with small batch sizes, and therefore requires less VRAM and inference time. It still gives better results with 4. Gave *ok* results at 1...