Ryan Taylor
Results
2
issues of
Ryan Taylor
Currently, the token sampling for MoD Infini-Former at inference time can result in different length sequences for each observation in the batch. The current workaround is to force the batch...
enhancement
help wanted
While running some tests, I noticed several memory inefficiencies that nullify the advantages of the Infini-Transformer. In particular: - The loop over segments needs to be moved out of the...
bug