compressed-transformer icon indicating copy to clipboard operation
compressed-transformer copied to clipboard

Compression of NMT transformer model with tensor methods

Results 2 compressed-transformer issues
Sort by recently updated
recently updated
newest added

Hi, I tried to integrate the TTLinear layer into TransformerXL, however I found that it consumes much more memory than usual. Couldn't even train it. Model before compression was 151M...

hi, I am in a puzzle about the inference time of the compressed model. Why is the compressed model more time consuming? Shouldn't it be faster with fewer parameters(about half...