0205090923

Results 2 issues of 0205090923

hi, i wonder how to get all hidden_states of selective_scan_cuda, it seems only the last hidden_state can be used, out, x, *rest = selective_scan_cuda.fwd(u, delta, A, B, C, D, z,...

Hello, I would like to know how long convolution ensures causal language modeling. It seems that I couldn't find any explicit padding applied in the code.