EDGE icon indicating copy to clipboard operation
EDGE copied to clipboard

Why use 2 time tokens?

Open KevinGoodman opened this issue 2 years ago • 2 comments

self.to_time_tokens = nn.Sequential(
    nn.Linear(latent_dim * 4, latent_dim * 2),  # 2 time tokens
    Rearrange("b (r d) -> b r d", r=2),
)

In L278-L281 of model/model.py, what is the purpose of making 2 time tokens instead of just 1 time token ?

KevinGoodman avatar Jun 29 '23 12:06 KevinGoodman

I'm also confuse about it. And in L376c = torch.cat((cond_tokens, t_tokens), dim=-2), why concat t tokens and cond tokens in time dimension?

xuzheyuan624 avatar Dec 27 '23 03:12 xuzheyuan624

@xuzheyuan624 it just concatenates the music condition and the time tokens along the sequence dimension to create a combined conditioned sequence that uses both music and time information

anshulkc avatar Sep 06 '24 22:09 anshulkc