Alexey Korepanov

Results 43 comments of Alexey Korepanov

text2video and text / image to 3d object are different tasks and this issue is still relevant it would be great if there was a stable-dreamfusion in a1111 I tried...

These errors can be caused if model inference is too slow on your computer. Also there is very basic example without handling all the possible problems. There are some hacks...

Right now I believe that best way to improve efficiency of the model is to work on model speed. Model works slow for older processors, so some hacks with some...

Yeah, I've tried it using onnxruntime-web in web worker. It works fine. I didn't look at footprint, only on CPU usage. I can calculate it later

My code is currently in my private gitlab repository. I think I'll be able to share this soon.

So, I've created draft PR to show my changes. I'll be glad to hear your feedback. This PR requires more changes to be compatible with current code, so it's draft...

@Penguin168 The `Offline` model processes audio completely - we put the full audio spectrogram into the model. The `Streaming` model processes audio frame by frame. Only the streaming model can...

I didn't tested it on phones, but soon i'm going to try it. I think, that changing hop_size can lead to quallity reducing. I thought about trying quantization to reduce...

Hi! @zeynepgulhanuslu I got a robotic voice if the model did not work fast enough and the AudioWorklet recieved audio samples too late. So it can be due to web...

Yeah, that would be great. Do you build wasm module using CPP? 7ms seems enough, because you have 10ms window to run model