Marianna comments

Results 24 comments of


                                            Marianna

[COMMUNITY] Call for contributions / tutorials / examples

Hi! Can I do the image registration? :)

Shard generator

> This is more efficient since it doesn't bring the data in memory: > > ```python > for i in range(len(dset) // batch_size) > start = i * batch_size >...

add audio spectrogram transformer, and full audio clip

Hi @lucidrains ! Can you use riffusion spectrogram as input in the `encode_image` function?

add audio spectrogram transformer, and full audio clip

@lucidrains no, unfortunately I get this error: `RuntimeError: Given groups=1, weight of size [768, 3, 16, 16], expected input[2, 1, 32, 1024] to have 3 channels, but got 1 channels...

add audio spectrogram transformer, and full audio clip

@lucidrains I checked again now it works! (I just forgot that I've made changes to the code) sorry, that's my bad!

add audio spectrogram transformer, and full audio clip

@lucidrains yes, I changed back to 1 channel and it worked, but also I tried to run it over a batch of images but it didn't work :(

add audio spectrogram transformer, and full audio clip

> @marianna13 i'll add the `MulanCoCa` version tomorrow too, so we can possibly leap frog the state of the art going on within google That's great! Thank you :)

add audio spectrogram transformer, and full audio clip

Hi @lucidrains ! Sorry for the late reply. Here's the code I'm using: ```python import torch import cv2 from src.open_clip import AudioCLIP, CLIPAudioCfg, CLIPTextCfg import webdataset as wds import sys...

add audio spectrogram transformer, and full audio clip

@lucidrains it works! Thank you! :)

add audio spectrogram transformer, and full audio clip

Hey @lucidrains, I tried to train a model with a small fraction of the dataset but it gets stuck at the first epoch and then gets killed. I can post...