Tao Lei comments

Results 41 comments of


Tao Lei

Confusion about computation in paper 'Simple Recurrent Units for Highly Parallelizable Recurrence' ?

@liziru It is version 2 by default. But you can test v1 by passing `v1 = True` in `SRU` or `SRUCell` constructor. See https://github.com/asappresearch/sru/blob/master/sru/modules.py#L444

cuda runtime error: an illegal memory access was encountered

hi, could you give me more context? the code or the input specification for example.

FAILED: sru_cpu_impl.o

It seems a lot people are having issues compiling the cpp code using `ninja`. This is only needed for inference on CPU. I'll release version that makes this optional..

FAILED: sru_cpu_impl.o

Have a PR open to address the issue: https://github.com/taolei87/sru/pull/76

build error

It seems a lot people are having issues compiling the cpp code using `ninja`. This is only needed for inference on CPU. I'll release version that makes this optional..

build error

Have a PR open to address the issue: https://github.com/taolei87/sru/pull/76

parameter names for SRUCell incompatible with eg nn.GRUCell

I will rename the parameter for consistency.

how fast on inference/testing for seq2seq task?

@loveJasmine I haven't tested the time. A rough estimate is that each LSTM layer has eight d*d matrix multiplications, while SRU has three. So SRU should have some advantage unless...

generate-dependencies-with-compile in RTX3060 Cuda11.1

hi @v-nhandt21 , the compilation arguments such as "--generate-dependencies-with-compile" are automatically added by `ninja` / `nvcc`. Looking at your first screenshot, ninja/nvcc attempts to build the code using `--sm_75` and...

Unknown builtin op: sru_cuda::sru_bi_forward_simple

Hi @ctlaltdefeat , Could you also provide some details: - are you trying to use the torchscript model in Python or C++? - are you trying to use the torchscript...