JY
Results
1
issues of
JY
self.time_weight = nn.Parameter(torch.ones(self.n_heads, self.block_length, self.block_length)) what is the block_size?