Gabriel Mongaras comments

Results 27 comments of


                                            Gabriel Mongaras

Probable Error in Computing Positional Encodings

Nice catch! Looks like I got the indexing wrong here, encoding the batch dimension as the "time" dimension. I suppose that means the position in the diffusion process becomes useless...

Probable Error in Computing Positional Encodings

I added a few warnings to mention the issue when running the code: https://github.com/gmongaras/Diffusion_models_from_scratch/blob/f2e76317d70eb565a953f4959f5781a010177318/src/blocks/PositionalEncoding.py#L19 I also changed the code to correctly index the batch and time dimensions: https://github.com/gmongaras/Diffusion_models_from_scratch/blob/f2e76317d70eb565a953f4959f5781a010177318/src/blocks/PositionalEncoding.py#L34 However since...

Key error with file data/Imagenet64/metadata.pkl

> Apparently (tell me if I am wrong): > — "loadImagenet64.py" needs "Imagenet64_train_part1.zip" and "Imagenet64_train_part2.zip". > Imagenet64x64 does not have these files. It rather has: > train_data_batch_1, train_data_batch_2, train_data_batch_3... etc...

Gabriel Mongaras

Probable Error in Computing Positional Encodings

Probable Error in Computing Positional Encodings

Key error with file data/Imagenet64/metadata.pkl

Key error with file data/Imagenet64/metadata.pkl

Key error with file data/Imagenet64/metadata.pkl

where is the CustomModel?

When using LLaVA-Llama3 for batch inference with the generate function, the results are incorrect.