Alexey Korepanov comments

Results 43 comments of


                                            Alexey Korepanov

[feature request] Text to 3d stable dreamfusion !!

text2video and text / image to 3d object are different tasks and this issue is still relevant it would be great if there was a stable-dreamfusion in a1111 I tried...

Raw examples of running DeepFilterNet3 on the Web (in JS)

These errors can be caused if model inference is too slow on your computer. Also there is very basic example without handling all the possible problems. There are some hacks...

Raw examples of running DeepFilterNet3 on the Web (in JS)

Right now I believe that best way to improve efficiency of the model is to work on model speed. Model works slow for older processors, so some hacks with some...

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

Yeah, I've tried it using onnxruntime-web in web worker. It works fine. I didn't look at footprint, only on CPU usage. I can calculate it later

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

My code is currently in my private gitlab repository. I think I'll be able to share this soon.

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

So, I've created draft PR to show my changes. I'll be glad to hear your feedback. This PR requires more changes to be compatible with current code, so it's draft...

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

@Penguin168 The `Offline` model processes audio completely - we put the full audio spectrogram into the model. The `Streaming` model processes audio frame by frame. Only the streaming model can...

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

I didn't tested it on phones, but soon i'm going to try it. I think, that changing hop_size can lead to quallity reducing. I thought about trying quantization to reduce...

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

Hi! @zeynepgulhanuslu I got a robotic voice if the model did not work fast enough and the AudioWorklet recieved audio samples too late. So it can be due to web...

I've made a torch reimplementation for both offline and streaming implementation. Would you be interested in accepting this contribution?

Yeah, that would be great. Do you build wasm module using CPP? 7ms seems enough, because you have 10ms window to run model