Gabriel Mongaras comments

Results 27 comments of


                                            Gabriel Mongaras

i have problem with Initializing custom image movement module

Nice!! Glad you got it working. I know LFS is a bit weird sometimes.

i have problem with Initializing custom image movement module

I initially had a feature where you could change the voice model given sample audio, but since the custom voice model was way too large and way too slow, I...

i have problem with Initializing custom image movement module

Sorry, but at the moment I don't have too much time to work on getting the custom voice module to work. The main issue is it was very experimental and...

i have problem with Initializing custom image movement module

Sorry, but I'm not quite sure what you mean by this.

i have problem with Initializing custom image movement module

Aside from blinking and lip syncing, I didn't add any other movement to the image. It would be a cool feature to add though!

p_mean_variance mean calculation

Oh yeah, that makes sense! As I've learned more about diffusion models, it looks like predicting x_0 produces better results as one can skip steps like in DDIM.

RuntimeError: einsum() operand subscript must be in range [a, z] but found C for operand 1

Thanks for letting me know! Are you using an older version of PyTorch? I think einsum used to be limited to lowercase characters as this GitHub issue shows: https://github.com/pytorch/pytorch/issues/21412 I...

RuntimeError: einsum() operand subscript must be in range [a, z] but found C for operand 1

Changing capital C to lowercase c will probably run into errors since the einsum will do the multiplication incorrectly. Try changing it from capital C to lowercase d: X =...

RuntimeError: einsum() operand subscript must be in range [a, z] but found C for operand 1

I added a training log here: https://github.com/gmongaras/Diffusion_models_from_scratch/blob/main/results/res_res_partial_log.out What do you mean when you say you moved this project to a segmentation task? Are you using a pre-trained model and finetuning...

RuntimeError: einsum() operand subscript must be in range [a, z] but found C for operand 1

That's just for logging. Instead of outputting the latest loss value (-1), I output the mean of the latest 10 losses (-10:) to reduce noise in the output loss value.